INDEX
Explanations
power, control, industry, knowledge, substance
New Auto-Interp
Negative Logits
دة
0.50
প্রশিক্ষ
0.45
Modo
0.45
सादर
0.45
us
0.44
Aran
0.44
ита
0.44
ни
0.43
Mitar
0.43
waar
0.43
POSITIVE LOGITS
Chines
0.42
industry
0.41
الصنا
0.39
ASSOCI
0.39
cities
0.39
INDUSTRY
0.39
Deputy
0.38
China
0.38
Associate
0.38
unprofitable
0.38
Activations Density 0.002%