INDEX
Explanations
attends to clauses containing the word "which" from preceding phrases that provide context on various topics
New Auto-Interp
Head Attr Weights
0:0.06
1:0.08
2:0.07
3:0.12
4:0.13
5:0.03
6:0.33
7:0.14
Negative Logits
acaktır
-0.29
our
-0.27
our
-0.27
Ф
-0.27
ten
-0.26
])));
-0.26
He
-0.26
Our
-0.25
eleste
-0.25
!"
-0.25
POSITIVE LOGITS
ⓧ
0.49
незавершена
0.49
autorytatywna
0.48
GEBURTSDATUM
0.48
ValueStyle
0.48
كومونز
0.48
Autoritní
0.45
beginnetje
0.44
WriteTagHelper
0.44
JspWriter
0.44
Activations Density 0.578%