INDEX
Explanations
names of specific individuals
proper nouns, particularly names of individuals and locations
New Auto-Interp
Negative Logits
âĸ¬
-0.91
âķIJâķIJ
-0.74
PID
-0.68
HCR
-0.68
boarding
-0.66
WARN
-0.64
CPC
-0.64
Miranda
-0.63
Gravity
-0.63
Haas
-0.63
POSITIVE LOGITS
yssey
1.06
ghan
0.92
Og
0.89
uchi
0.89
oby
0.86
acity
0.86
regor
0.85
rib
0.85
acious
0.84
awa
0.84
Activations Density 0.007%