INDEX
Explanations
instances where something is being reported or stated
phrases indicating the presence or occurrence of events
New Auto-Interp
Negative Logits
ocol
-0.66
anding
-0.62
osa
-0.60
catentry
-0.59
WARNING
-0.59
forward
-0.58
eem
-0.56
proclaiming
-0.56
DMV
-0.55
vt
-0.54
POSITIVE LOGITS
been
1.21
been
1.05
Been
1.03
gotten
1.00
undergone
0.92
gotten
0.92
ĸļ
0.84
rils
0.83
arisen
0.78
come
0.76
Activations Density 0.112%