INDEX
Explanations
phrases related to causation or prediction of outcomes
phrases indicating causality or outcomes
New Auto-Interp
Negative Logits
oqu
-0.65
yd
-0.63
pmwiki
-0.62
Cas
-0.59
advertisement
-0.59
kus
-0.59
ascus
-0.59
Unknown
-0.57
Bastard
-0.57
Pic
-0.57
POSITIVE LOGITS
someday
0.86
enance
0.81
geries
0.81
igate
0.78
tomorrow
0.72
gery
0.71
rued
0.69
lessly
0.67
some
0.67
sooner
0.66
Activations Density 0.321%