INDEX
Explanations
occurrences of specific numbers and punctuation
New Auto-Interp
Negative Logits
inement
-0.73
pursu
-0.72
transgress
-0.66
æ©
-0.65
targeted
-0.64
skirm
-0.64
overl
-0.64
aband
-0.63
equivalents
-0.63
arous
-0.63
POSITIVE LOGITS
Yeah
1.30
Alright
1.12
Okay
1.01
Exactly
1.01
Originally
0.99
Absolutely
0.98
Yeah
0.97
Hmm
0.94
Hey
0.93
Firstly
0.93
Activations Density 0.050%