INDEX
Explanations
proper nouns
sequences of repeated tokens or placeholders
New Auto-Interp
Negative Logits
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
-0.67
choke
-0.66
pockets
-0.61
installation
-0.61
transfers
-0.60
Ö¼
-0.60
installations
-0.59
Malfoy
-0.59
folder
-0.59
redemption
-0.59
POSITIVE LOGITS
Daily
1.13
aretz
1.12
News
1.11
Politics
1.09
NBC
1.03
Politico
0.99
Journal
0.96
Daily
0.96
daily
0.95
itbart
0.94
Activations Density 0.259%