INDEX
Explanations
accidental damage and danger
New Auto-Interp
Negative Logits
’
0.48
vier
0.47
itos
0.46
four
0.45
Фа
0.44
zys
0.44
four
0.44
ikiem
0.44
six
0.43
Algebra
0.43
POSITIVE LOGITS
endangering
0.59
возникновения
0.57
accidentally
0.56
jeopardize
0.56
danos
0.55
contaminate
0.55
导致
0.53
endanger
0.53
irrepar
0.53
accidental
0.52
Activations Density 0.810%