INDEX
Explanations
names of people or characters
proper nouns, particularly names of individuals
New Auto-Interp
Negative Logits
theless
-0.75
ModLoader
-0.74
underwater
-0.73
âĶĢâĶĢ
-0.71
anwhile
-0.69
etheless
-0.68
LEASE
-0.65
Melania
-0.65
ãĥŁ
-0.64
Galileo
-0.64
POSITIVE LOGITS
atz
1.02
ovich
1.01
lett
1.00
itz
1.00
acci
0.98
zen
0.97
inger
0.95
inski
0.95
owski
0.93
ansky
0.93
Activations Density 0.306%