INDEX
Explanations
phrases related to military operations and political actions
New Auto-Interp
Negative Logits
chrom
-0.68
cube
-0.62
ovi
-0.59
Finch
-0.58
pedia
-0.57
backdrop
-0.57
女
-0.56
Slate
-0.56
Compass
-0.56
Sunshine
-0.54
POSITIVE LOGITS
upon
1.05
upon
0.85
eous
0.79
on
0.77
itect
0.74
On
0.71
Upon
0.70
ently
0.70
rieg
0.69
azard
0.68
Activations Density 0.082%