INDEX
Explanations
phrases that indicate a cause and effect relationship
New Auto-Interp
Negative Logits
élevées
-0.57
/**
-0.46
больше
-0.45
/
-0.44
jenigen
-0.43
like
-0.42
/*
-0.41
and
-0.40
以下
-0.40
tos
-0.39
POSITIVE LOGITS
result
1.72
result
1.30
consequence
1.29
matter
1.22
resultado
1.17
RESULT
1.12
resultat
1.06
Result
1.05
Resultat
1.01
risultato
0.99
Activations Density 0.188%