INDEX
Explanations
numerical ratings associated with performances or events
New Auto-Interp
Negative Logits
illery
-0.08
forward
-0.07
ohana
-0.07
flix
-0.07
Term
-0.07
olulu
-0.07
_palette
-0.07
ammo
-0.06
osaur
-0.06
адж
-0.06
POSITIVE LOGITS
rol
0.06
91
0.06
pcl
0.06
.exe
0.05
Ger
0.05
0
0.05
oy
0.05
406
0.05
overhead
0.05
theirs
0.05
Activations Density 0.001%