INDEX
Explanations
proper nouns and names, potentially related to locations or entities
punctuation, specifically commas and periods
New Auto-Interp
Negative Logits
zing
-0.68
¦
-0.66
worldly
-0.66
¯
-0.66
:[
-0.65
ãĥ¼
-0.63
ĸ
-0.63
verage
-0.62
IJ
-0.60
't
-0.60
POSITIVE LOGITS
dominates
1.04
arrives
1.00
appears
0.99
became
0.97
was
0.97
has
0.97
belongs
0.97
appeared
0.96
resides
0.96
earns
0.96
Activations Density 0.292%