INDEX
Explanations
instances of the word "For" in the text
phrase constructions that introduce examples or instances
New Auto-Interp
Negative Logits
ownt
-0.64
deserves
-0.62
itiz
-0.62
izzle
-0.57
forg
-0.56
beg
-0.56
pron
-0.56
è¦ļéĨĴ
-0.55
illin
-0.55
Eat
-0.55
POSITIVE LOGITS
example
1.47
cing
1.31
instance
1.28
gotten
1.19
bidden
1.16
ced
1.16
starters
1.08
got
1.03
comparison
1.02
Example
1.00
Activations Density 0.060%