INDEX
Explanations
phrases related to negative outcomes or consequences
the past participle forms of verbs
New Auto-Interp
Negative Logits
compress
-0.69
relocation
-0.66
hiber
-0.66
crunch
-0.65
withdrawal
-0.65
prediction
-0.65
Maid
-0.63
annexation
-0.63
metic
-0.62
limp
-0.62
POSITIVE LOGITS
aught
1.02
lain
0.86
minded
0.85
erer
0.85
baugh
0.85
alone
0.85
rings
0.78
erers
0.78
unes
0.78
rying
0.78
Activations Density 0.026%