INDEX
Explanations
words related to ranks or titles
punctuation and stylized text formats
New Auto-Interp
Negative Logits
IDENT
-0.64
aminer
-0.63
Poll
-0.63
IRC
-0.62
Bomber
-0.62
Alert
-0.60
velt
-0.59
Platform
-0.58
LECT
-0.58
TPP
-0.57
POSITIVE LOGITS
a
0.87
o
0.72
ahs
0.67
acia
0.67
ta
0.65
Ãł
0.65
és
0.64
ataka
0.64
nen
0.61
dds
0.60
Activations Density 0.051%