INDEX
Explanations
commencement of events
end of turn start
New Auto-Interp
Negative Logits
0.87
'
0.87
ד
0.87
۰
0.87
ascribe
0.82
(
0.81
0.80
Iran
0.77
0.77
EM
0.77
POSITIVE LOGITS
temprana
0.82
is
0.82
ца
0.81
Starts
0.81
Gén
0.78
Quelques
0.78
आती
0.76
to
0.76
Commencez
0.76
starts
0.75
Activations Density 2.937%