INDEX
Explanations
words involving unique or uncommon characters in the text
repeated phonetic sounds within words
New Auto-Interp
Negative Logits
notation
-0.75
anium
-0.72
doi
-0.69
nexus
-0.67
whiff
-0.66
fork
-0.65
reddits
-0.63
Äĩ
-0.63
Cosponsors
-0.62
raph
-0.61
POSITIVE LOGITS
ãĤ´ãĥ³
0.69
CLASSIFIED
0.62
awei
0.62
Oracle
0.60
influenza
0.60
çͰ
0.59
metic
0.56
çĦ
0.56
Frey
0.55
jong
0.55
Activations Density 0.281%