INDEX
Explanations
mentions of the musician Kanye West
New Auto-Interp
Negative Logits
ioned
-0.87
cially
-0.86
apy
-0.82
代
-0.81
elly
-0.81
perature
-0.80
ional
-0.77
phrine
-0.75
mond
-0.72
alties
-0.69
POSITIVE LOGITS
regate
0.84
reth
0.81
ofer
0.76
sta
0.74
Commerce
0.73
enstein
0.71
units
0.71
lund
0.70
rats
0.69
adding
0.68
Activations Density 0.058%