INDEX
Explanations
forms of the verb "to be" in various tenses
New Auto-Interp
Negative Logits
Frazier
-0.74
sqor
-0.69
iggins
-0.66
rities
-0.64
onds
-0.64
LIN
-0.63
Awakens
-0.62
attest
-0.61
yip
-0.61
Ingredients
-0.61
POSITIVE LOGITS
forth
0.97
able
0.93
chosen
0.90
singled
0.82
supposed
0.80
bothered
0.79
compelled
0.79
named
0.78
bothering
0.77
drawn
0.76
Activations Density 0.069%