INDEX
Explanations
names of individuals, possibly in a list or sequence
proper nouns, especially names and titles
New Auto-Interp
Negative Logits
bryce
-0.68
Pwr
-0.62
osterone
-0.61
Roundup
-0.60
Skydragon
-0.59
captivity
-0.57
disson
-0.55
yourselves
-0.54
prost
-0.54
plaque
-0.53
POSITIVE LOGITS
ilver
0.83
vu
0.80
hua
0.72
inen
0.72
edu
0.67
direction
0.67
zee
0.65
ilan
0.64
vre
0.64
fecture
0.64
Activations Density 0.145%