INDEX
Explanations
words related to personal opinions and evaluations
New Auto-Interp
Negative Logits
INSEE
-0.65
Hamlin
-0.61
menti
-0.61
Drapeau
-0.60
Mela
-0.60
kém
-0.59
rsiniz
-0.59
м
-0.55
"..\..\
-0.55
okovic
-0.55
POSITIVE LOGITS
been
1.21
تانيه
1.20
للمعارف
0.95
gotta
0.92
been
0.89
0.88
)’
0.84
gonna
0.83
got
0.80
itudinal
0.80
Activations Density 0.251%