INDEX
Negative Logits
ãĤ¤ãĥĪ
-0.72
aroo
-0.69
vertisement
-0.69
é¾
-0.68
»Ĵ
-0.68
HAHAHAHA
-0.65
Palest
-0.65
clicked
-0.62
Pigs
-0.61
ij士
-0.61
POSITIVE LOGITS
eper
0.74
arth
0.74
igham
0.74
aults
0.74
Finn
0.72
Marie
0.71
ĸļ
0.70
ucker
0.69
abo
0.68
ecast
0.67
Activations Density 12.940%