Useful tips for everyday
How is model quality measured? To evaluate your model’s quality, commonly-used metrics are: loss. accuracy. precision & recall. area under the ROC curve (AUC) What…
Why do you need to split you dataset into a train set and a test set? The reason is that when the dataset is split…
What is accuracy in multi-label classification? Accuracy classification score. In multilabel classification, this function computes subset accuracy: the set of labels predicted for a sample…
How does a BERT tokenizer work? The BERT model receives a fixed length of sentence as input. Usually the maximum length of a sentence depends…
How do you create a column in DataFrame based on another column? Create New Columns in Pandas DataFrame Based on the Values of Other Columns…
Is overfitting possible in linear regression? Regression. In regression analysis, overfitting occurs frequently. As an extreme example, if there are p variables in a linear…
What are rules in Apriori? The Apriori algorithm calculates rules that express probabilistic relationships between items in frequent itemsets For example, a rule derived from…
How do you generalize a model? Generalization refers to your model’s ability to adapt properly to new, previously unseen data, drawn from the same distribution…
How do you know if Random Forest is overfitting? The Random Forest algorithm does overfit. The generalization error variance is decreasing to zero in the…
How can I improve my ner model? In order from biggest potential for improvement to lowest potential: annotate NER training data, especially if your custom…