INDEX
Explanations
phrases referencing or mentioning something or someone
instances of the word "referring" and similar phrases that indicate attribution or citation of information
New Auto-Interp
Negative Logits
foothold
-0.69
hold
-0.68
tes
-0.66
tmp
-0.65
mega
-0.64
»Ĵ
-0.62
rieve
-0.61
sburgh
-0.60
eu
-0.60
lineback
-0.60
POSITIVE LOGITS
sarcast
0.89
referring
0.88
sarc
0.85
Nib
0.75
Jub
0.73
quoting
0.71
joking
0.69
insult
0.69
ento
0.68
homophobic
0.68
Activations Density 0.056%