INDEX
Explanations
phrases indicating causation or results
New Auto-Interp
Negative Logits
'{@-0.67
batalha
-0.67
entgegen
-0.60
escuchadas
-0.60
casó
-0.60
voraus
-0.58
strå
-0.57
styleable
-0.57
fluence
-0.57
enfans
-0.55
POSITIVE LOGITS
resulted
0.90
caused
0.80
resulting
0.80
ToAction
0.80
increased
0.79
eventual
0.78
导致
0.77
causes
0.76
causing
0.74
caused
0.72
Activations Density 0.347%