INDEX
Explanations
words related to support or endorsement
references to support or endorsement
New Auto-Interp
Negative Logits
tein
-0.83
awk
-0.72
RT
-0.69
odor
-0.68
kel
-0.66
ppa
-0.66
js
-0.66
ogens
-0.66
NT
-0.65
Hebdo
-0.65
POSITIVE LOGITS
backing
1.16
backed
0.80
backed
0.75
vocals
0.75
swing
0.73
steen
0.71
track
0.71
backer
0.69
footing
0.68
Jindal
0.68
Activations Density 0.004%