INDEX
Explanations
words/phrases ending with 'h'
occurrences of the letter "h" in the text
New Auto-Interp
Negative Logits
Tyrann
-0.66
mammoth
-0.66
tremend
-0.63
tyrann
-0.60
terminal
-0.59
Tonight
-0.59
princ
-0.58
accountable
-0.58
congr
-0.57
ente
-0.56
POSITIVE LOGITS
irts
1.36
adow
1.18
ooters
1.16
ooter
1.08
ield
1.08
IFT
1.02
oulder
0.99
ippers
0.98
owers
0.98
apers
0.95
Activations Density 0.030%