INDEX
Explanations
numerical ratings given to various entities or products
phrases related to ratings or evaluations
New Auto-Interp
Negative Logits
vous
-0.91
ansson
-0.81
Nou
-0.80
alter
-0.78
Alz
-0.77
prus
-0.76
adra
-0.75
adr
-0.72
phys
-0.70
apo
-0.69
POSITIVE LOGITS
rated
1.27
veter
1.04
rating
1.00
ratings
0.99
Ratings
0.96
Rated
0.90
Rating
0.90
Rated
0.86
rating
0.83
Reviewed
0.80
Activations Density 0.010%