INDEX
Negative Logits
Generally
0.76
xticks
0.73
ियर
0.72
ر
0.71
ಾರ್
0.71
Функция
0.70
Substituting
0.70
Func
0.69
Replacing
0.69
Appending
0.69
POSITIVE LOGITS
exquisite
0.96
terrifying
0.96
horrifying
0.92
situation
0.91
meticulous
0.91
aftermath
0.90
terrible
0.88
sudden
0.87
captivating
0.87
ominous
0.86
Activations Density 0.941%