INDEX
Explanations
words related to endings or conclusions
New Auto-Interp
Negative Logits
isible
-0.19
tslib
-0.17
inerary
-0.16
forme
-0.16
atables
-0.15
imenti
-0.15
ieme
-0.15
atform
-0.15
ledged
-0.15
á»Ļn
-0.14
POSITIVE LOGITS
wind
0.23
ow
0.21
Wind
0.20
ended
0.20
up
0.19
wound
0.19
us
0.19
ear
0.18
Wind
0.18
wind
0.18
Activations Density 0.011%