INDEX
Explanations
phrases related to actions and capabilities
phrases indicating conditional situations or consequences
New Auto-Interp
Negative Logits
raft
-0.77
¯
-0.73
mx
-0.71
ode
-0.67
neg
-0.65
owl
-0.64
dds
-0.64
uga
-0.63
sav
-0.63
MSN
-0.62
POSITIVE LOGITS
accordingly
1.04
thereafter
0.90
alike
0.72
consequently
0.70
consequ
0.67
advoc
0.66
notor
0.66
reused
0.66
afterwards
0.65
thereof
0.64
Activations Density 0.808%