INDEX
Explanations
instances of the word "shows" and related verbs indicating presentation or demonstration of information
New Auto-Interp
Negative Logits
hlen
-0.08
اسب
-0.07
oku
-0.07
eman
-0.07
uchs
-0.07
Pence
-0.07
æ²¹
-0.07
ordan
-0.06
ominator
-0.06
brahim
-0.06
POSITIVE LOGITS
hoa
0.07
promise
0.07
.Deep
0.07
itself
0.07
little
0.06
especially
0.05
Little
0.05
clearly
0.05
pires
0.05
cref
0.05
Activations Density 0.022%