INDEX
Explanations
the letter 'y' followed by other characters in the text
occurrences of the letter 'y'
New Auto-Interp
Negative Logits
ãĥ¯
-0.79
Oracle
-0.79
PowerPoint
-0.73
ãĥ´ãĤ¡
-0.69
tenance
-0.69
sonian
-0.69
iston
-0.68
byter
-0.66
asio
-0.66
usercontent
-0.65
POSITIVE LOGITS
idd
0.88
ummy
0.83
outh
0.82
ield
0.80
dy
0.80
aku
0.78
ahoo
0.77
ouse
0.76
angled
0.75
eros
0.74
Activations Density 0.009%