INDEX
Explanations
fighter aircraft
references to fighter jets
New Auto-Interp
Negative Logits
andre
-0.71
opia
-0.70
imeter
-0.70
ories
-0.70
ologies
-0.68
ologically
-0.68
eus
-0.67
Ceres
-0.67
imated
-0.66
APS
-0.65
POSITIVE LOGITS
jets
1.16
jet
1.08
squadron
0.88
pilots
0.88
Squadron
0.87
Typhoon
0.84
fighting
0.82
fighter
0.79
fighters
0.78
riors
0.77
Activations Density 0.055%