INDEX
Explanations
purpose or audience of text
New Auto-Interp
Negative Logits
showcasing
0.44
powierzchni
0.42
ProfileAction
0.42
showing
0.41
narrowly
0.40
looping
0.39
伷
0.39
wide
0.39
foregoing
0.38
stylist
0.38
POSITIVE LOGITS
impairs
0.49
опас
0.44
->
0.44
enfermedades
0.44
、
0.44
BENEF
0.44
危害
0.44
可能
0.43
ផល
0.43
Factors
0.42
Activations Density 0.001%