INDEX
Explanations
references to statements and quotes made by individuals
New Auto-Interp
Negative Logits
[];
-0.77
الرياضيه
-0.71
[],
-0.70
}*/
-0.68
.*;
-0.65
виправивши
-0.64
?>/
-0.64
"},
-0.63
"){
-0.62
++
-0.62
POSITIVE LOGITS
obé
0.58
said
0.57
sobbed
0.53
honte
0.53
replied
0.53
said
0.53
vocato
0.52
ضب
0.52
paksa
0.50
ništ
0.49
Activations Density 0.045%