INDEX
Explanations
have or has followed by outcome
New Auto-Interp
Negative Logits
Hence
0.78
With
0.62
Hence
0.59
Response
0.57
excluding
0.56
ين
0.56
except
0.56
Worth
0.56
Color
0.55
Sometimes
0.55
POSITIVE LOGITS
been
1.71
gotten
1.24
BEEN
1.11
got
1.09
begun
1.08
been
1.07
arisen
1.04
fått
0.98
undergone
0.93
venido
0.90
Activations Density 0.247%