INDEX
Explanations
names and places that seem to be in German
words and phrases related to names or identities
New Auto-Interp
Negative Logits
prem
-0.66
Luffy
-0.63
Mayweather
-0.63
Bucc
-0.59
warrant
-0.58
BCC
-0.58
Warrant
-0.58
whistle
-0.58
pokemon
-0.58
uously
-0.56
POSITIVE LOGITS
chen
1.11
lein
1.06
tera
1.00
ische
0.99
nel
0.98
ner
0.98
lich
0.97
vre
0.96
ste
0.96
sel
0.92
Activations Density 0.084%