INDEX
Explanations
phrases indicating attempts or efforts to achieve something
New Auto-Interp
Negative Logits
PLWABN
-0.51
preocupação
-0.51
possibilidade
-0.49
autorytatywna
-0.48
exercí
-0.47
experimented
-0.47
initComponents
-0.47
experiment
-0.47
ungkinan
-0.46
überprü
-0.46
POSITIVE LOGITS
convince
0.91
persuade
0.84
convincing
0.77
persuading
0.75
coax
0.69
persuaded
0.68
convinced
0.67
locate
0.66
somehow
0.66
Somehow
0.66
Activations Density 0.640%