INDEX
Explanations
mentions of the name "Thor."
New Auto-Interp
Negative Logits
ropoda
-0.17
zer
-0.16
archy
-0.15
Ø´ÛĮ
-0.15
gor
-0.14
è©ķ
-0.14
ائÙħ
-0.14
erman
-0.14
ÑĤ
-0.14
ivre
-0.14
POSITIVE LOGITS
acic
0.32
nton
0.32
OUGH
0.24
arin
0.21
bj
0.20
wald
0.20
sten
0.19
Thor
0.19
azine
0.18
stein
0.18
Activations Density 0.003%