INDEX
Explanations
phrases or sentences indicating someone was caught doing something, often with negative connotations
instances of the word "caught" in various contexts
New Auto-Interp
Negative Logits
pronoun
-0.77
pronouns
-0.75
helicop
-0.70
umably
-0.67
livest
-0.67
edom
-0.66
reluct
-0.66
anwhile
-0.66
persuasion
-0.65
hov
-0.65
POSITIVE LOGITS
ãĤµ
0.85
caught
0.75
Rai
0.75
ãĤ¤
0.74
glimps
0.72
Frames
0.72
phrase
0.72
aught
0.72
Luffy
0.70
netflix
0.70
Activations Density 0.016%