INDEX
Explanations
words related to properties or processes
references to the concept of "pro" or professionalism
New Auto-Interp
Negative Logits
needles
-0.74
gow
-0.71
spears
-0.70
Twain
-0.70
Cornell
-0.68
Dwell
-0.67
Halls
-0.67
woodland
-0.67
Graveyard
-0.66
bodied
-0.66
POSITIVE LOGITS
digy
1.25
blems
1.15
secut
1.09
posal
1.06
secution
1.03
gressive
1.01
ceed
0.98
gression
0.96
hibited
0.93
verbs
0.93
Activations Density 0.015%