INDEX
Negative Logits
mempertahankan
0.39
बढ़ाने
0.38
Preservation
0.38
сохранения
0.37
쓰
0.37
बढ़ाने
0.35
没有任何
0.35
एंप
0.34
추가
0.34
preserves
0.34
POSITIVE LOGITS
relieved
0.55
thankfully
0.55
mitigated
0.54
解消
0.53
rectified
0.52
dissipate
0.51
remedied
0.50
mitigation
0.50
dissipated
0.50
alleviated
0.49
Activations Density 0.088%