INDEX
Explanations
phrases related to being caught or caught in a conflict or situation
New Auto-Interp
Negative Logits
sburgh
-0.64
die
-0.61
arten
-0.61
bard
-0.61
pronoun
-0.60
princip
-0.60
inburgh
-0.59
sclerosis
-0.59
edly
-0.58
AMY
-0.57
POSITIVE LOGITS
phrase
1.06
netflix
0.82
tails
0.78
glimps
0.78
izoph
0.74
doors
0.73
bait
0.72
Luffy
0.71
tail
0.71
unprepared
0.70
Activations Density 0.782%