INDEX
Explanations
phrases indicating an overall summary or conclusion
the phrase "in all" and its variations, indicating a focus on inclusivity or totality
New Auto-Interp
Negative Logits
fell
-0.70
arthed
-0.67
eday
-0.67
stadt
-0.63
Citation
-0.63
bryce
-0.62
gaard
-0.62
raction
-0.61
sworth
-0.61
ALSE
-0.59
POSITIVE LOGITS
clusive
1.11
CLUS
0.69
sudden
0.67
oots
0.67
together
0.65
toget
0.64
ighter
0.63
together
0.63
patient
0.61
ooting
0.61
Activations Density 0.060%