INDEX
Explanations
proper nouns, primarily names and brands
New Auto-Interp
Negative Logits
httphttps
-0.86
verwijspagina
-0.78
pinulongan
-0.70
ssaint
-0.69
"")
-0.69
المعيارى
-0.67
aarrggbb
-0.66
"");
-0.64
SSON
-0.63
abelle
-0.61
POSITIVE LOGITS
myſelf
1.06
itſelf
1.02
Monfieur
0.97
Majefty
0.91
Cæsar
0.91
Houſe
0.84
Anſ
0.84
ſelves
0.83
ſmall
0.81
Efq
0.81
Activations Density 0.488%