INDEX
Explanations
words related to people's names
the presence of a specific name or term repeatedly mentioned in various contexts
New Auto-Interp
Negative Logits
GGGGGGGG
-0.72
Bucks
-0.69
Yemeni
-0.63
Somalia
-0.61
é¾įå
-0.60
Belgium
-0.60
Ethiopia
-0.60
Axis
-0.59
Äĩ
-0.58
Macedonia
-0.58
POSITIVE LOGITS
apeake
1.22
terday
1.19
creen
1.08
hire
0.94
chool
0.92
ema
0.87
gemony
0.87
hirt
0.87
worth
0.85
chel
0.84
Activations Density 0.014%