INDEX
Explanations
phrases indicating repetition or frequency
mentions of repeated actions or occurrences
New Auto-Interp
Negative Logits
Reviewer
-0.88
XT
-0.84
ains
-0.77
Marginal
-0.76
aceous
-0.74
ettle
-0.74
ourt
-0.73
rats
-0.73
agra
-0.72
rights
-0.72
POSITIVE LOGITS
consecut
0.97
cale
0.86
throughout
0.78
fold
0.75
points
0.74
manship
0.72
coded
0.70
soever
0.66
before
0.65
apiece
0.64
Activations Density 0.036%