INDEX
Explanations
instances of the word "a" in various contexts
New Auto-Interp
Negative Logits
Sucesor
-0.69
vorticity
-0.60
insec
-0.59
poussière
-0.59
Israël
-0.58
antenn
-0.57
idiota
-0.57
iguana
-0.56
autorité
-0.56
indiqué
-0.55
POSITIVE LOGITS
a
1.17
few
1.09
large
1.03
different
1.02
great
1.01
larger
0.99
{}",0.94
new
0.93
huge
0.93
very
0.91
Activations Density 1.204%