INDEX
Explanations
dates and proper nouns related to locations
mentions of air force operations or military activities
New Auto-Interp
Negative Logits
amplification
-0.58
listener
-0.56
SHIP
-0.53
WT
-0.52
Trails
-0.49
isSpecialOrderable
-0.49
trial
-0.49
incentive
-0.49
cest
-0.48
anonymously
-0.48
POSITIVE LOGITS
zona
0.67
pora
0.65
womb
0.64
oola
0.63
è£ıè
0.59
Coffin
0.55
rays
0.55
obl
0.53
Jr
0.53
aviour
0.53
Activations Density 1.419%