INDEX
Explanations
occurrences of the word "thinking" and its variations
New Auto-Interp
Negative Logits
lero
-0.08
legate
-0.07
agna
-0.06
ppard
-0.06
akest
-0.06
nejd
-0.06
ì§Ī
-0.06
ëŁŃ
-0.06
aign
-0.06
ÏĦαι
-0.06
POSITIVE LOGITS
Outside
0.07
outside
0.07
cap
0.07
ÐĴÐŀ
0.07
Bout
0.06
cap
0.06
ach
0.06
about
0.06
è¿Ľ
0.06
iability
0.06
Activations Density 0.005%