INDEX
Explanations
proper nouns, particularly names and locations
New Auto-Interp
Negative Logits
Torrent
-0.70
Mechdragon
-0.67
Viz
-0.63
ion
-0.63
lov
-0.59
Jackets
-0.58
Brut
-0.57
Amir
-0.57
mith
-0.57
Pedro
-0.57
POSITIVE LOGITS
sie
0.63
analy
0.62
okes
0.61
ruit
0.61
entity
0.60
��
0.59
yg
0.58
opot
0.58
OHN
0.57
Cola
0.57
Activations Density 1.297%