INDEX
Explanations
instances where a condition is met or a potential consequence is mentioned
conditional phrases indicating potential situations or events
New Auto-Interp
Negative Logits
ļéĨĴ
-0.75
oked
-0.74
leground
-0.73
ãĤ¨ãĥ«
-0.71
nai
-0.70
zeb
-0.69
arse
-0.69
apsed
-0.68
alli
-0.68
resent
-0.68
POSITIVE LOGITS
then
1.03
chances
0.97
it
0.95
then
0.90
surely
0.83
however
0.83
expect
0.83
they
0.79
hopefully
0.76
perhaps
0.74
Activations Density 0.179%