INDEX
Explanations
crucial, important, vital, unusual
New Auto-Interp
Negative Logits
arbitration
0.44
والث
0.41
concomit
0.40
considerations
0.39
annotations
0.39
whit
0.38
abr
0.38
concomitant
0.38
()?;
0.37
이유는
0.37
POSITIVE LOGITS
having
0.55
этих
0.53
bahwa
0.52
avoir
0.52
こういう
0.51
dieser
0.50
цих
0.50
mundial
0.48
Unternehmen
0.47
melihat
0.46
Activations Density 0.148%