Most Popular

1500 questions
9
votes
1 answer

Deriving backpropagation equations "natively" in tensor form

Image shows a typical layer somewhere in a feed forward network: $a_i^{(k)}$ is the activation value of the $i^{th}$ neuron in the $k^{th}$ layer. $W_{ij}^{(k)}$ is the weight connecting $i^{th}$ neuron in the $k^{th}$ layer to the $j^{th}$ neuron…
Neil Slater
  • 29,388
  • 5
  • 82
  • 101
9
votes
2 answers

Why don't tree ensembles require one-hot-encoding?

I know that models such as random forest and boosted trees don't require one-hot encoding for predictor levels, but I don't really get why. If the tree is making a split in the feature space, then isn't there an inherent ordering involved? There…
9
votes
3 answers

Why do we use a Gaussian kernel as a similarity metric?

In graph-based clustering, why is it preferred to use the Gaussian kernel rather than the distance between two points as the similarity metric?
zfb
  • 91
  • 1
  • 1
  • 4
9
votes
3 answers

Why use convolutional NNs for a visual inspection task over classic CV template matching?

I had an interesting discussion come up based on a project we were working on: why use a CNN visual inspection system over a template matching algorithm? Background: I had shown a demo of a simple CNN vision system (webcam + laptop) that detected if…
9
votes
6 answers

NLP: What are some popular packages for multi-word tokenization?

I intend to tokenize a number of job description texts. I have tried the standard tokenization using whitespace as the delimiter. However I noticed that there are some multi-word expressions that are splitted by whitespace, which may well cause…
CyberPlayerOne
  • 392
  • 1
  • 4
  • 15
9
votes
1 answer

How does a convolutional ply differ from an ordinary convolutional network?

I am currently working on recreating the results of this paper. In the paper they describe a method for using CNN for features extraction, and have a acoustic model that is Dnn-hmm and pretrained using RBM. Section III subsection A states…
9
votes
5 answers

Perceptron learning rate

Today I've seen many Perceptron implementations with learning rates. According to Wikipedia: there is no need for a learning rate in the perceptron algorithm. This is because multiplying the update by any constant simply rescales the weights…
bkoodaa
  • 323
  • 3
  • 5
  • 8
9
votes
2 answers

Is there a C library for machine learning algorithms?

Are there any machine learning libraries for C. Specifically interested in unsupervised learning.
9
votes
1 answer

feature extraction for a pretrained model in keras

Keras has a way to extract the features of a pretrained model, described here https://keras.io/applications/ from keras.applications.vgg16 import VGG16 from keras.preprocessing import image from keras.applications.vgg16 import…
spore234
  • 613
  • 8
  • 14
9
votes
1 answer

What is the memory cost of a CNN?

I was recently thinking about the memory cost of (a) training a CNN and (b) inference with a CNN. Please note, that I am not talking about the storage (which is simply the number of parameters). How much memory does a given CNN (e.g. VGG-16 D) need…
Martin Thoma
  • 19,540
  • 36
  • 98
  • 170
9
votes
3 answers

Why do popular search engines not follow the usual AND, OR logic for queries?

I am teaching myself Information Retrieval from Christopher Manning's book (PDF link: http://nlp.stanford.edu/IR-book/pdf/01bool.pdf). I tried Exercise 1.13: "Try using the Boolean search features on a couple of major web search engines.…
user21595
9
votes
3 answers

What is the correct way to compute Mean F1 score?

I have a set of 10 experiments that compute precision, recall and f1-score for each experiment. Now, average precision & average recall is easy to compute. I have some confusion regarding average f1-score. There are 2 ways on how i can compute mean…
Pinkesh Badjatiya
  • 249
  • 1
  • 2
  • 7
9
votes
4 answers

Algorithms and techniques for spell checking

Can anyone suggest any algorithm and technique for spell checking? After some googling, I found some interesting ones such as this one from Peter Norvig, http://norvig.com/spell-correct.html and few others. However, most of them were written many…
edwin
  • 343
  • 1
  • 3
  • 10
9
votes
4 answers

Which book is a standard for introduction to genetic algorithms?

I have heard of genetic algorithms, but I have never seen practical examples and I've never got a systematic introduction to them. I am now looking for a textbook which introduces genetic algorithms in detail and gives practical examples how they…
Martin Thoma
  • 19,540
  • 36
  • 98
  • 170
9
votes
3 answers

What knowledge do I need in order to write a simple AI program to play a game?

I'm a B.Sc graduate. One of my courses was 'Introduction to Machine Learning', and I always wanted to do a personal project in this subject. I recently heard about different AI training to play games such as Mario, Go, etc. What knowledge do I need…