INDEX
Explanations
past and present participles used as descriptors
New Auto-Interp
Negative Logits
lies
-0.77
Relief
-0.71
Must
-0.69
izable
-0.68
Needs
-0.68
ives
-0.64
rones
-0.63
terday
-0.63
warts
-0.62
Solution
-0.62
POSITIVE LOGITS
subjected
1.03
criticized
0.99
likened
0.98
replaced
0.94
unable
0.92
able
0.92
avering
0.91
taken
0.91
criticised
0.90
cffffcc
0.89
Activations Density 0.799%