INDEX
Explanations
proper nouns
specific letters and letter combinations that frequently appear in the text
New Auto-Interp
Negative Logits
²¾
-0.66
ãĥij
-0.63
·
-0.62
د
-0.62
ĩ
-0.60
«
-0.59
Ship
-0.58
IJ
-0.57
ĸ
-0.57
Bridge
-0.57
POSITIVE LOGITS
.—
0.62
ðŁĺ
0.60
ðŁĻĤ
0.57
supers
0.53
jail
0.50
chant
0.49
dominated
0.49
ASAP
0.47
cham
0.47
indefinitely
0.47
Activations Density 1.489%