INDEX
Explanations
references to movie reviews or critiques
New Auto-Interp
Negative Logits
abase
-0.82
SPONSORED
-0.77
href
-0.70
ortium
-0.65
mpeg
-0.64
natureconservancy
-0.63
lehem
-0.61
acca
-0.61
sole
-0.60
ersen
-0.58
POSITIVE LOGITS
kefeller
0.85
Janeiro
0.78
CTR
0.72
backer
0.71
restling
0.66
Berry
0.60
Grac
0.60
runner
0.60
monary
0.60
warts
0.59
Activations Density 3.116%