INDEX
Explanations
numeric ratings in the form of stars
references to star ratings in various contexts
New Auto-Interp
Negative Logits
ĸļ
-0.90
£ı
-0.89
Canaver
-0.89
odcast
-0.86
pse
-0.77
ptives
-0.75
nown
-0.73
apons
-0.70
hod
-0.69
sym
-0.69
POSITIVE LOGITS
vation
0.99
rating
0.93
rated
0.84
ved
0.80
Rated
0.78
ratings
0.76
rating
0.75
ochet
0.72
bucks
0.72
star
0.72
Activations Density 0.013%