Useful tips for everyday
Why is elastic net better than lasso and Ridge? Lasso will eliminate many features, and reduce overfitting in your linear model. Ridge will reduce the…
How do you find the dimensions of a matrix product? Size of a matrix = number of rows × number of columns. It can be…
Why use VAE instead of ae? A deep neural VAE is quite similar in architecture to a regular AE. The main difference is that the…
How many standard deviations is an outlier? Three standard deviations Three standard deviations from the mean is a common cut-off in practice for identifying outliers…
What is vanishing gradient problem? Vanishing gradients is a particular problem with recurrent neural networks as the update of the network involves unrolling the network…
What is maximum margin hyperplane in SVM? The best or optimal line that can separate the two classes is the line that as the largest…
How do you deal with a skewed distribution? Dealing with skew data: log transformation: transform skewed distribution to a normal distribution. Remove outliers. Normalize (min-max)…
What are the features of reinforcement learning? In the most interesting and challenging cases, actions may affect not only the immediate reward, but also the…
What does a high root mean square error mean? If the noise is small, as estimated by RMSE, this generally means our model is good…
What is autoregressive flow? Neural Autoregressive Flows provide a way to combine expressive transformations with tractable changes in probability distributions. To simulate random samples from…