INDEX
Explanations
references to prominent individuals or organizations in political contexts
expressions of capability and perceived actions
assigning or giving
New Auto-Interp
Negative Logits
Become
-1.03
Become
-1.01
Becoming
-0.99
become
-0.97
Becoming
-0.96
becoming
-0.96
become
-0.95
becomes
-0.94
Becomes
-0.93
becoming
-0.90
POSITIVE LOGITS
put
0.67
assign
0.65
introduce
0.60
introducing
0.58
oa̍t
0.57
把
0.56
entrust
0.55
treating
0.55
allocate
0.54
把
0.54
Activations Density 2.266%