INDEX
Explanations
references to political figures with specific names
mentions of specific individuals, particularly those involved in politics
New Auto-Interp
Negative Logits
fertile
-0.70
brightest
-0.68
matically
-0.67
indo
-0.66
FAA
-0.66
################
-0.62
direction
-0.60
takeoff
-0.60
Flying
-0.60
latitude
-0.59
POSITIVE LOGITS
otte
1.58
borg
1.07
reau
0.90
sburg
0.89
boro
0.85
quist
0.83
ecake
0.82
nil
0.82
yre
0.80
etz
0.79
Activations Density 0.003%