INDEX
Explanations
expressing hope for positive outcomes
New Auto-Interp
Negative Logits
そらく
0.42
பெரும்பாலும்
0.41
Probably
0.40
おそらく
0.40
cenderung
0.40
prawdopod
0.38
sogenannte
0.38
সাধারণত
0.37
sogenannten
0.37
تقريبا
0.37
POSITIVE LOGITS
adequately
0.61
bermanfaat
0.60
helpful
0.59
satisfactory
0.59
enough
0.59
sufficiently
0.57
satisfactorily
0.57
genug
0.56
useful
0.55
possa
0.54
Activations Density 0.054%