INDEX
Explanations
phrases indicating events or actions happening before and after something
phrases that indicate comparisons of events or conditions occurring before and after a specific point in time
New Auto-Interp
Negative Logits
Journals
-0.61
gur
-0.60
agy
-0.60
illi
-0.60
Petr
-0.60
WARE
-0.59
Kard
-0.58
Fram
-0.58
ADRA
-0.58
Merit
-0.57
POSITIVE LOGITS
aft
0.97
during
0.89
behind
0.78
rogen
0.78
during
0.77
ecycle
0.76
rogens
0.76
isode
0.73
present
0.73
AFTER
0.71
Activations Density 0.052%