INDEX
Explanations
entities related to government and political figures, particularly in the context of decisions or controversies
possessive forms indicating ownership or association
New Auto-Interp
Negative Logits
raviolet
-0.69
gist
-0.65
awaru
-0.65
ebus
-0.61
rhy
-0.61
KC
-0.60
odium
-0.60
pressures
-0.60
carbohyd
-0.60
merce
-0.60
POSITIVE LOGITS
女
1.00
ï¸ı
0.89
Ļ
0.88
ãĥ¥
0.86
éĩ
0.83
¯¯
0.83
ħ
0.81
SHIP
0.80
Ùħ
0.80
âĢ¢âĢ¢âĢ¢âĢ¢
0.80
Activations Density 0.526%