INDEX
Negative Logits
reliable
0.87
sincerity
0.85
દો
0.80
Allard
0.78
reliability
0.77
sincere
0.71
convergence
0.71
수준
0.71
assurances
0.71
முழுக்க
0.70
POSITIVE LOGITS
Easy
1.11
easier
1.05
Eas
1.05
easy
1.03
Easy
1.03
Easier
1.02
fácilmente
0.97
facilidad
0.97
便于
0.94
gemakkelijk
0.91
Activations Density 0.280%