INDEX
Explanations
phrases related to media and news stories
commas in the text
New Auto-Interp
Negative Logits
¬¼
-0.74
ety
-0.63
vered
-0.62
ocl
-0.62
enary
-0.60
UF
-0.58
iple
-0.58
irie
-0.57
isi
-0.57
uber
-0.57
POSITIVE LOGITS
albeit
1.15
although
1.07
whereas
1.03
namely
0.97
but
0.96
which
0.96
though
0.93
including
0.93
regardless
0.86
whereby
0.85
Activations Density 0.721%