INDEX
Explanations
proper nouns representing individuals or groups
instances of the verb "to be" in various forms
New Auto-Interp
Negative Logits
ampa
-0.66
ennett
-0.65
.>>
-0.63
owa
-0.63
rers
-0.62
STER
-0.61
lly
-0.60
.;
-0.58
omsday
-0.58
mens
-0.57
POSITIVE LOGITS
formerly
1.10
originally
1.05
previously
1.04
wolves
1.00
hes
1.00
supposed
0.95
instrumental
0.91
initially
0.89
conceived
0.89
born
0.86
Activations Density 0.176%