INDEX
Explanations
numerical ratings and stars
New Auto-Interp
Negative Logits
зировать
0.39
своих
0.39
পারব
0.38
නයේ
0.38
ముఖ్య
0.38
أهم
0.37
sécr
0.37
중요
0.37
аргу
0.36
argu
0.35
POSITIVE LOGITS
rating
0.53
Rating
0.52
Rating
0.50
rating
0.48
stars
0.45
puan
0.45
calificación
0.45
awarded
0.45
Rated
0.44
Punkte
0.42
Activations Density 0.038%