INDEX
Explanations
references to direct speech involving the word "You" in different contexts
direct address or references to "you."
New Auto-Interp
Negative Logits
¿½
-0.98
£ı
-0.75
tains
-0.70
ipal
-0.69
Commerce
-0.68
enges
-0.65
Additional
-0.65
ŃĶ
-0.64
20439
-0.64
abin
-0.64
POSITIVE LOGITS
're
1.49
guys
1.30
gotta
1.26
bastard
1.25
deserve
1.21
've
1.17
wanna
1.15
idiots
1.14
owe
1.12
idiot
1.12
Activations Density 0.130%