INDEX
Explanations
instances of the phrase "you are" in varying contexts
New Auto-Interp
Negative Logits
etails
-0.17
ÎIJ
-0.17
$MESS
-0.16
eners
-0.16
agon
-0.15
wner
-0.15
èĤ
-0.15
rada
-0.15
pector
-0.14
ãĤŃãĥ¼
-0.14
POSITIVE LOGITS
ormsg
0.16
overe
0.15
Boom
0.15
fat
0.15
Zug
0.14
ana
0.14
Sam
0.14
Ages
0.13
231
0.13
Brian
0.13
Activations Density 0.021%