INDEX
Explanations
words related to political and governmental contexts
references related to administrative or governmental topics
New Auto-Interp
Negative Logits
decomp
-0.97
Codec
-0.73
gib
-0.71
hob
-0.69
pile
-0.67
silhou
-0.67
sacrific
-0.67
recording
-0.67
parachute
-0.67
sandwich
-0.66
POSITIVE LOGITS
£
0.94
¢
0.93
Ħ¢
0.90
¬
0.90
ı
0.87
ates
0.87
º
0.85
§
0.85
lege
0.83
ħ
0.83
Activations Density 0.376%