INDEX
Explanations
mentions of people's names
references to specific individuals or entities, particularly in a political context
New Auto-Interp
Negative Logits
constitu
-0.67
geoning
-0.60
sts
-0.59
Emerald
-0.56
upload
-0.55
ioch
-0.55
manship
-0.55
ldom
-0.55
tack
-0.55
itational
-0.55
POSITIVE LOGITS
ãĥ¯ãĥ³
0.65
uese
0.65
AAF
0.64
ij士
0.63
OIL
0.63
oji
0.62
ippi
0.61
WithNo
0.61
qua
0.61
cium
0.60
Activations Density 0.176%