INDEX
Explanations
attributes or states of being
New Auto-Interp
Negative Logits
avacanam
0.18
হইবার
0.18
nhưng
0.17
mogao
0.17
是一種
0.17
কিন্তু
0.16
vacanam
0.16
podendo
0.16
conseguido
0.16
причем
0.16
POSITIVE LOGITS
whose
0.21
actively
0.21
involved
0.20
currently
0.20
heavily
0.20
deemed
0.20
consistently
0.20
truly
0.19
responsible
0.18
poised
0.18
Activations Density 0.181%