INDEX
Explanations
official statements and commitments
New Auto-Interp
Negative Logits
révol
0.86
发现
0.80
discovers
0.78
發現
0.74
discover
0.72
tespit
0.69
detects
0.69
finden
0.68
обнаружи
0.68
ๅ
0.68
POSITIVE LOGITS
reiterated
1.56
vowed
1.48
pledged
1.46
pledge
1.43
pledges
1.40
vows
1.39
vow
1.39
reaffirmed
1.35
anunció
1.34
promised
1.33
Activations Density 0.139%