INDEX
Explanations
references to the letter "Y"
the letter "Y" and its repetitions in various contexts
New Auto-Interp
Negative Logits
ãĥ¼ãĥĨãĤ£
-0.91
ãĥ¼ãĥĨ
-0.77
icable
-0.73
Cortana
-0.71
entimes
-0.70
idity
-0.67
neapolis
-0.67
ilogy
-0.67
Mehran
-0.64
"$:/
-0.63
POSITIVE LOGITS
ield
0.97
aku
0.95
UV
0.95
ank
0.92
ORK
0.92
ahoo
0.91
ANK
0.91
LD
0.91
anks
0.90
urt
0.89
Activations Density 0.026%