INDEX
Explanations
references to political offices or official positions
references to various "offices" associated with individuals or governmental entities
New Auto-Interp
Negative Logits
âķIJâķIJ
-0.80
oise
-0.78
iser
-0.70
ï¸
-0.68
Haram
-0.67
âķIJ
-0.66
MQ
-0.66
Laksh
-0.63
ushima
-0.63
velength
-0.63
POSITIVE LOGITS
holders
1.16
holder
1.07
tops
0.93
holder
0.88
mate
0.81
holders
0.80
matical
0.79
room
0.78
press
0.77
bearer
0.76
Activations Density 0.023%