INDEX
Explanations
statements or questions expressing disbelief or incredulity
expressions of disbelief or incredulity
New Auto-Interp
Negative Logits
catentry
-0.94
lez
-0.76
Runner
-0.70
ipple
-0.64
Ripple
-0.63
skeleton
-0.62
Survivors
-0.61
ateral
-0.61
Graveyard
-0.60
aina
-0.59
POSITIVE LOGITS
suddenly
0.80
ammed
0.80
DonaldTrump
0.74
anything
0.73
knowingly
0.73
anyone
0.72
blindly
0.71
anybody
0.70
otherwise
0.69
bothered
0.68
Activations Density 0.249%