INDEX
Explanations
dates or time related information
New Auto-Interp
Negative Logits
ertain
-0.92
isible
-0.83
tick
-0.80
xx
-0.79
eg
-0.79
psy
-0.77
lez
-0.76
aez
-0.74
Duration
-0.73
Grade
-0.72
POSITIVE LOGITS
refusing
0.83
allegations
0.83
failing
0.82
admitting
0.82
discovering
0.81
authorities
0.80
unsuccessfully
0.80
mistakenly
0.80
they
0.80
undergoing
0.78
Activations Density 0.889%