INDEX
Explanations
proper nouns with quotation marks around them
quotation marks and their usage in the text
New Auto-Interp
Negative Logits
Ͻ
-0.96
ĻĤ
-0.80
¿
-0.78
¾
-0.76
ailing
-0.74
aults
-0.73
stant
-0.70
¸
-0.70
ousse
-0.68
ushi
-0.67
POSITIVE LOGITS
/"
1.35
moniker
0.89
appell
0.76
designation
0.74
aka
0.74
mantra
0.71
motto
0.71
label
0.70
mentality
0.69
("0.68
Activations Density 0.102%