INDEX
Explanations
instances where the word "In" is capitalized and followed by some content that follows a specific structure or template
the word "In" at the beginning of various statements or contexts
New Auto-Interp
Negative Logits
ages
-0.67
pudding
-0.66
goodbye
-0.65
yes
-0.65
mouths
-0.64
stra
-0.64
fuck
-0.64
plat
-0.64
swe
-0.63
smooth
-0.63
POSITIVE LOGITS
In
2.63
During
1.98
On
1.86
According
1.80
When
1.79
Through
1.78
After
1.76
At
1.75
ccording
1.75
Since
1.74
Activations Density 0.067%