INDEX
Explanations
instances where individuals are physically confined or immobilized
instances of the word "trapped" in various contexts
New Auto-Interp
Negative Logits
issance
-0.92
sych
-0.92
sburgh
-0.75
itage
-0.74
roxy
-0.72
ific
-0.69
resso
-0.68
eds
-0.66
nings
-0.65
frey
-0.65
POSITIVE LOGITS
trapped
0.81
souls
0.78
DonaldTrump
0.75
Quantity
0.74
untarily
0.74
pigeon
0.73
ravel
0.70
Labyrinth
0.69
©¶æ¥µ
0.68
ducks
0.68
Activations Density 0.034%