INDEX
Explanations
instances of the word "the" or phrases with significant occurrences of "the"
New Auto-Interp
Negative Logits
wikipagina
-0.77
Sodom
-0.71
Esau
-0.69
Euripides
-0.67
Fascism
-0.66
secondly
-0.65
zelve
-0.65
tiennent
-0.64
Shakspeare
-0.63
hereof
-0.62
POSITIVE LOGITS
entire
1.28
same
1.25
most
1.07
whole
1.06
latter
1.00
last
0.96
vast
0.96
majority
0.96
final
0.95
")));
0.95
Activations Density 0.954%