INDEX
Explanations
references to specific objects or physical locations
New Auto-Interp
Negative Logits
idates
-0.82
furthermore
-0.79
rity
-0.78
erity
-0.76
moreover
-0.73
icals
-0.71
additionally
-0.69
hess
-0.69
fficient
-0.67
iencies
-0.66
POSITIVE LOGITS
proverbial
0.86
sponge
0.86
steroids
0.78
miniature
0.77
magnet
0.75
Gest
0.74
aspirin
0.73
yip
0.73
heartbeat
0.72
spaghetti
0.72
Activations Density 1.655%