INDEX
Explanations
check or verify information
New Auto-Interp
Negative Logits
पेन
0.65
aiut
0.63
Chancen
0.63
but
0.63
Öffentlich
0.62
ફી
0.62
renforcer
0.61
ईमान
0.60
murah
0.59
refund
0.59
POSITIVE LOGITS
sebelum
1.08
before
1.07
قبل
0.98
перед
0.97
Before
0.94
Sebelum
0.89
înainte
0.87
Before
0.86
Перед
0.85
before
0.85
Activations Density 0.312%