INDEX
Explanations
superior followed by positive outcomes
New Auto-Interp
Negative Logits
ش
1.03
ت
0.95
س
0.87
স
0.84
ق
0.84
ع
0.84
ن
0.82
ج
0.79
ш
0.79
h
0.78
POSITIVE LOGITS
whistleblower
0.68
kõik
0.63
immobil
0.59
seeker
0.59
thermoelectric
0.59
borrower
0.58
reimbursement
0.58
staunch
0.57
workstation
0.57
cathode
0.56
Activations Density 0.001%