INDEX
Explanations
phrases or words indicating an exception or an alternative
phrases that indicate exceptions or additional information
New Auto-Interp
Negative Logits
plurality
-0.57
rye
-0.56
tumble
-0.53
disinfect
-0.53
machine
-0.52
Meadow
-0.51
shutter
-0.51
deposition
-0.51
Pace
-0.51
ettle
-0.51
POSITIVE LOGITS
heid
1.22
ments
1.07
icularly
0.91
ranging
0.88
selves
0.85
entimes
0.85
ional
0.84
rompt
0.80
ractor
0.80
ively
0.78
Activations Density 0.033%