INDEX
Explanations
mentions of personal interests and hobbies
hobbies and interests
New Auto-Interp
Negative Logits
RegressionTest
-0.55
Italij
-0.41
hotelu
-0.39
SequentialGroup
-0.36
keamanan
-0.36
politiet
-0.35
saldır
-0.34
bancada
-0.34
distance
-0.34
opération
-0.33
POSITIVE LOGITS
hobby
0.89
hobbies
0.88
Hobbies
0.83
Hobbies
0.82
Hobby
0.81
Hobby
0.75
AutoField
0.75
hobby
0.75
Interests
0.69
surla
0.69
Activations Density 0.097%