INDEX
Explanations
references to balance in various contexts
New Auto-Interp
Negative Logits
ollah
-0.16
²
-0.15
AVA
-0.15
oble
-0.14
ensa
-0.14
lis
-0.14
lus
-0.14
ütün
-0.14
enga
-0.13
idd
-0.13
POSITIVE LOGITS
balance
0.27
balance
0.22
Balance
0.20
(balance
0.19
-bal
0.17
Balance
0.16
andro
0.16
ÑĦоÑĢ
0.15
-priced
0.15
بÙĨدÛĮ
0.15
Activations Density 0.044%