INDEX
Explanations
proper nouns of individuals
proper names, specifically individuals
New Auto-Interp
Negative Logits
âĢº
-0.77
......
-0.67
..........
-0.65
_-_
-0.59
ãĥ©ãĥ³
-0.57
âĢİ
-0.56
arial
-0.56
�
-0.55
Reloaded
-0.54
.............
-0.53
POSITIVE LOGITS
owan
0.55
ensibly
0.53
ritical
0.52
reth
0.51
vetoed
0.50
muse
0.49
addafi
0.47
illus
0.46
mun
0.46
ussion
0.46
Activations Density 0.974%