INDEX
Explanations
human body parts
references to people, organizations, or specific identifiers
New Auto-Interp
Negative Logits
fig
-0.73
vine
-0.71
lance
-0.70
bestos
-0.70
Fet
-0.69
paren
-0.68
ribune
-0.63
glomer
-0.63
TPS
-0.62
utherland
-0.62
POSITIVE LOGITS
suffers
0.87
prefers
0.84
underwent
0.83
enjoys
0.82
engaged
0.81
's
0.80
agrees
0.80
knew
0.80
undertook
0.79
possesses
0.79
Activations Density 0.824%