INDEX
Explanations
affirmative and future-oriented statements
New Auto-Interp
Negative Logits
oba
-0.07
sik
-0.07
ROUT
-0.06
komp
-0.06
Hope
-0.06
udu
-0.06
udem
-0.06
uta
-0.06
enou
-0.06
bio
-0.06
POSITIVE LOGITS
utzer
0.07
excellent
0.07
excell
0.07
toe
0.06
soon
0.06
fare
0.06
'..',
0.06
great
0.06
success
0.06
equally
0.06
Activations Density 0.018%