INDEX
Explanations
references to the character "Captain America" or related terms
references to the character Captain America
New Auto-Interp
Negative Logits
ucl
-0.67
exch
-0.67
numbered
-0.67
ocene
-0.66
itual
-0.64
Neigh
-0.64
tsy
-0.60
upon
-0.60
aeda
-0.60
choice
-0.59
POSITIVE LOGITS
cy
1.17
cies
0.98
esses
0.91
Kidd
0.86
berries
0.84
Picard
0.80
istics
0.77
Phillips
0.76
berry
0.76
ials
0.74
Activations Density 0.031%