INDEX
Explanations
terms related to concepts of power, opportunity, and social structure
New Auto-Interp
Negative Logits
including
-0.45
บ้าง
-0.42
including
-0.41
таратура
-0.40
באופן
-0.40
ipedi
-0.39
Including
-0.37
включая
-0.36
cintura
-0.36
klärt
-0.36
POSITIVE LOGITS
unto
0.96
worth
0.81
worthy
0.75
deserving
0.67
tagHelperRunner
0.65
waiting
0.63
indeed
0.62
bagi
0.59
akin
0.59
geworden
0.58
Activations Density 0.627%