INDEX
Explanations
terms related to societal and economic challenges, particularly in the context of political discourse
New Auto-Interp
Negative Logits
ujednoznacz
-0.49
lage
-0.47
host
-0.47
igil
-0.46
Low
-0.45
paired
-0.43
RTLR
-0.43
trud
-0.42
pairs
-0.42
pless
-0.41
POSITIVE LOGITS
сылкі
0.73
PreferredItem
0.72
BagConstraints
0.67
ModelRenderer
0.67
()]);
0.65
Byzantium
0.65
tvguidetime
0.63
istoitu
0.62
يتيمه
0.62
astify
0.61
Activations Density 0.439%