Highest Voted Questions - Data Science Stack Exchange

9

votes

3 answers

Convolutional Neural Networks layer sizes

I am trying to understand an article Backpropagation In Convolutional Neural Networks But I can not wrap my head around that diagram: The first layer has 3 feature maps with dimensions 32x32. The second layer has 32 feature maps with dimensions…

neural-network cnn convolutional-neural-network convolution

asked Feb 27 '18 at 05:58

koryakinp

436
2
5
14

9

votes

1 answer

Imbalanced data causing mis-classification on multiclass dataset

I am working on text classification where I have 39 categories/classes and 8.5 million records. (In future data and categories will increase). Structure or format of my data is as…

machine-learning python classification scikit-learn multiclass-classification

asked Feb 16 '18 at 11:09

outlier

193
5

9

votes

2 answers

LSTM Feature selection process

We need to implement a time series problem with the LSTM model. But, while implementing the same, the main challenge I am facing is the feature selection issue. Because our data-set contains 2300 observations and 600 features. And we already know…

time-series feature-selection lstm

asked Feb 16 '18 at 07:20

Abhishek Saha

91
1
3

9

votes

2 answers

Understanding the mathematics of AdaGrad and AdaDelta

I have been building some models for a project, but I can't wrap my head around the math of Adagrad and Adadelta algorithms. I do understand how vanilla gradient descent works and I have written code for making it work successfully. I will be…

machine-learning gradient-descent

asked Feb 10 '18 at 13:21

Malay Hazarika

91
1
3

9

votes

2 answers

Which algorithm to apply for choosing the right point

Picture below shows 7 points around the origin. One of them has been selected by a human based on rules and experience and is colored red (the one in the lower left quadrant). Now we have over 1000 of these sets of points and for each set a human…

machine-learning

asked Feb 08 '18 at 11:04

Elmex80s

191
4

9

votes

1 answer

How to train data by batch from disk?

I am working on a convolutional neural network for image classification. The training dataset is too large to be loaded on my computer memory (4gb), on top of that I also need to try some augmentation to balance the classes. I am using keras. I have…

python keras training convolution

asked Feb 01 '18 at 21:26

Learning is a mess

646
1
8
16

9

votes

4 answers

How to get predicted class labels in convolution neural network?

I have built a convolutional neural network which is needed to classify the test data into either 0 or 1. I am training the CNN with labels either 0 or 1 but while running the below code I am getting the following result. predictions =…

machine-learning deep-learning neural-network cnn convolutional-neural-network

asked Jan 28 '18 at 17:20

LIsa

93
1
1
3

9

votes

2 answers

Input for LSTM for financial time series directional prediction

I'm working on using an LSTM to predict the direction of the market for the next day. My question concerns the input for the LSTM. My data is a financial time series $x_1 \ldots x_t$ where each $x_i$ represents a vector of features for day $i$, i.e…

machine-learning time-series lstm rnn sequence-to-sequence

asked Jan 25 '18 at 17:12

articuno

99
3

9

votes

2 answers

Using NLP to automate the categorization of user description

I have a huge file of customer complaints about the products my company owns and I would like to do a data analysis on those descriptions and tag a category to each of them. For example: I need to figure out the number of complaints on Software and…

data-mining classification nlp categorical-data

asked Dec 09 '14 at 20:49

SRS

1,065
5
11
22

9

votes

1 answer

How does YOLO algorithm detect objects if the grid size is way smaller than the object in the test image?

In YOLO algorithm how do these grids output a prediction if some grids only see a small black portion of the car if the model was trained on datasets with full images?

image-classification image-recognition object-recognition yolo

asked Jan 08 '18 at 09:09

Rishi Swethan

101
1
2

9

votes

1 answer

How to get a confidence score for predictions?

In a regression problem, is it possible to calculate a confidence/reliability score for a certain prediction given models like XGBoost or Neural Networks?

machine-learning neural-network regression decision-trees xgboost

asked Jan 06 '18 at 18:36

Henrique Nader

511
2
7
16

9

votes

1 answer

What is the BLEU score used in Google Brain's "Attention Is All You Need" paper?

Google Brain's Attention Is All You Need paper on sequence-to-sequence translation reports: Our model achieves 28.4 BLEU on the WMT 2014 Englishto-German translation task, improving over the existing best results, including ensembles, by over 2…

neural-network machine-translation

asked Dec 21 '17 at 02:29

Imran

2,381
13
22

9

votes

3 answers

How to train a xgboost model on data that is too big for the memory?

What are the best practices to train xgboost (eXtreme gradient boosting) models on data that is to big to hold it in memory at once? Splitting the data and train multiple models? Are there more elegant solutions?

python xgboost

asked Dec 03 '17 at 18:36

Soerendip

744
1
9
16

9

votes

1 answer

Understanding batch normalization

In the paper Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift (here), before explaining the process of batch normalization, the paper tries to explain the issues related with (I am not getting what the…

neural-network deep-learning batch-normalization

asked Oct 30 '17 at 16:50

figs_and_nuts

903
1
5
17

9

votes

2 answers

What is the difference between concept class and hypothesis

Formal definition that I have seen of concept class is class of all true functions mathematically : $f:X \rightarrow\{0,1\}$ and that of hypothesis is: $h:X \rightarrow\{0,1\}$ But most of the times they are used together. For example in…

machine-learning

asked Oct 19 '17 at 07:30

user40687

99
1
2

Most Popular