INDEX
Explanations
phrases related to the general status or condition of things/events
phrases that express change or the state of affairs
New Auto-Interp
Negative Logits
lication
-0.76
pora
-0.75
Joined
-0.69
lees
-0.69
obook
-0.68
nor
-0.66
descriptor
-0.65
ELD
-0.63
ledge
-0.62
odied
-0.62
POSITIVE LOGITS
downhill
0.98
spir
0.88
unravel
0.76
escalated
0.76
unfolded
0.75
smoothly
0.74
transpired
0.74
MpServer
0.72
Spiral
0.71
cov
0.71
Activations Density 0.196%