INDEX
Explanations
the word "are" in various contexts
the word "are," indicating emphasis on being or existence
New Auto-Interp
Negative Logits
omez
-0.69
ingen
-0.67
uration
-0.65
imating
-0.61
oting
-0.59
allery
-0.59
ured
-0.59
ertodd
-0.59
imation
-0.58
OOL
-0.58
POSITIVE LOGITS
nce
1.03
nces
1.01
tsky
1.00
tto
0.97
nda
0.88
tta
0.87
nt
0.86
nd
0.83
zza
0.82
lli
0.81
Activations Density 0.019%