INDEX
Explanations
true statements leading to contradictions
New Auto-Interp
Negative Logits
потенциа
0.47
φέ
0.46
이
0.45
element
0.44
ája
0.44
𝖗
0.44
का
0.44
Kald
0.43
готовка
0.42
오
0.42
POSITIVE LOGITS
inyin
0.48
spiritual
0.47
Spiritual
0.44
pgamma
0.44
sphing
0.44
preached
0.43
bilirubin
0.43
prophes
0.43
موسیقی
0.43
datatables
0.42
Activations Density 0.006%