INDEX
Explanations
references to emotional experiences or situations
New Auto-Interp
Negative Logits
+#+
-0.72
springfox
-0.70
клопе
-0.67
UserScript
-0.65
DockStyle
-0.63
NOPQRST
-0.61
AssemblyCulture
-0.60
WebElementEntity
-0.60
انيف
-0.58
település
-0.57
POSITIVE LOGITS
kinda
0.63
bigliamento
0.63
dunno
0.61
definitely
0.58
literally
0.57
Literally
0.57
actually
0.56
Literally
0.54
gotta
0.54
doveva
0.54
Activations Density 0.081%