INDEX
Explanations
proper names starting with "Van"
the mention of the name "Van" or related individuals
New Auto-Interp
Negative Logits
ĪĴ
-0.79
ilial
-0.79
ãģĨ
-0.75
reluct
-0.73
ij士
-0.72
inally
-0.71
ĺħ
-0.67
ometimes
-0.66
ocene
-0.66
earch
-0.65
POSITIVE LOGITS
Gaal
1.04
quished
0.96
Hels
0.90
ishing
0.85
illa
0.84
adium
0.84
rol
0.80
kel
0.79
neys
0.79
ross
0.78
Activations Density 0.009%