INDEX
Explanations
mentions of frequency or repetition
the word "each" to highlight instances of collective or individual consideration
New Auto-Interp
Negative Logits
hin
-0.78
dad
-0.64
soType
-0.60
Sport
-0.60
news
-0.57
ensen
-0.57
Prediction
-0.56
audi
-0.56
ota
-0.54
sic
-0.54
POSITIVE LOGITS
each
3.34
each
2.67
Each
2.27
Each
2.26
apiece
1.97
every
1.71
every
1.23
annually
1.21
respectively
1.19
EVERY
1.16
Activations Density 0.032%