INDEX
Explanations
instances of the word "only."
New Auto-Interp
Negative Logits
elijk
-0.15
roperties
-0.15
isher
-0.14
ëģĶ
-0.14
ively
-0.14
(es
-0.14
ạch
-0.14
484
-0.14
iosk
-0.14
841
-0.13
POSITIVE LOGITS
íģ¼
0.17
sandbox
0.17
лиÑĪÑĮ
0.16
Fans
0.15
Brave
0.15
nda
0.14
odie
0.14
Badge
0.14
verture
0.14
baÅŁÄ±na
0.14
Activations Density 0.088%