INDEX
Explanations
the word "find" and its variations across different contexts
New Auto-Interp
Negative Logits
odic
-0.15
ubern
-0.15
er
-0.15
von
-0.14
vrier
-0.14
umm
-0.14
sted
-0.14
veau
-0.13
ongo
-0.13
stitial
-0.13
POSITIVE LOGITS
ائÙĦ
0.15
änn
0.15
touch
0.15
661
0.15
McN
0.14
lernen
0.14
ç¾
0.14
boot
0.14
ÏĦια
0.14
iless
0.14
Activations Density 0.004%