INDEX
Explanations
mentions of historical events or figures
sequences of numerical values or ratings associated with content
New Auto-Interp
Negative Logits
©¶æ
-0.89
ulkan
-0.75
nesday
-0.73
wagen
-0.71
ertodd
-0.69
iga
-0.65
overth
-0.65
partisans
-0.64
<[
-0.64
utsche
-0.64
POSITIVE LOGITS
SPONSORED
1.00
PHOTOS
0.95
Advertisement
0.82
AUT
0.82
Writing
0.82
é¾
0.82
Scroll
0.80
Age
0.80
Known
0.77
Their
0.76
Activations Density 0.694%