INDEX
Explanations
characters or symbols associated with emoticons or special characters
various symbols and special characters
New Auto-Interp
Negative Logits
independ
-0.67
Accord
-0.66
Militia
-0.64
suicide
-0.64
applic
-0.63
suspensions
-0.61
delegates
-0.61
simultane
-0.61
Virgin
-0.60
Creation
-0.60
POSITIVE LOGITS
ł
2.09
©
2.07
Ģ
2.05
Ĩ
2.04
¸
2.03
¹
2.03
ĥ
2.03
Ķ
2.02
ī
2.02
Ĭ
2.02
Activations Density 0.031%