INDEX
Explanations
proper nouns, particularly names of characters and places
New Auto-Interp
Negative Logits
ilst
-0.16
ÑģÑĥÑĤ
-0.15
apsed
-0.15
.Modules
-0.14
ray
-0.14
omics
-0.14
imbus
-0.14
rays
-0.14
kus
-0.14
ebi
-0.14
POSITIVE LOGITS
Society
0.25
society
0.22
societal
0.17
ãĥªãĥ¼ãĤº
0.17
gente
0.16
scandal
0.16
gentlemen
0.15
Reform
0.15
Lord
0.15
soc
0.15
Activations Density 0.043%