INDEX
Explanations
instances of the word "you" and its variations
New Auto-Interp
Negative Logits
uries
-0.18
inel
-0.17
ilton
-0.17
-tip
-0.16
ulace
-0.15
adoo
-0.15
stream
-0.15
revel
-0.15
Sav
-0.14
premises
-0.14
POSITIVE LOGITS
ervo
0.16
orsch
0.16
597
0.15
Cumhur
0.14
/ns
0.14
Forgery
0.14
448
0.14
Bounding
0.13
/lists
0.13
.yahoo
0.13
Activations Density 0.021%