INDEX
Explanations
instances of the word "take" in the text
New Auto-Interp
Negative Logits
gren
-0.18
gth
-0.15
igar
-0.15
Profes
-0.15
obel
-0.15
anik
-0.14
omor
-0.14
imon
-0.14
onte
-0.14
_MR
-0.14
POSITIVE LOGITS
ldb
0.15
ead
0.14
ç´Ģ
0.14
OSP
0.14
Tato
0.14
_modes
0.14
punct
0.14
incl
0.14
strained
0.13
strate
0.13
Activations Density 0.000%