INDEX
Explanations
phrases that express causation or reasons
Immediately precedes "the" and follows "to" or "of"
reasons for consequences
New Auto-Interp
Negative Logits
uxxxx
-0.57
Athenians
-0.57
Allegretto
-0.57
Agamemnon
-0.53
itſelf
-0.51
idéia
-0.51
persino
-0.50
sendiri
-0.50
myſelf
-0.49
་་
-0.49
POSITIVE LOGITS
lack
0.90
reasons
0.85
its
0.81
their
0.75
reasons
0.73
limitations
0.72
adanya
0.71
being
0.71
necessity
0.70
differences
0.69
Activations Density 0.206%