INDEX
Negative Logits
avises
0.51
wohner
0.50
फारिश
0.49
patham
0.48
ಿಯೇ
0.48
ăpadă
0.47
orientated
0.47
anzit
0.47
ভৃতি
0.46
গ্
0.46
POSITIVE LOGITS
Unle
0.59
Understanding
0.57
Understanding
0.57
Secrets
0.55
Principles
0.53
captivating
0.52
Discover
0.52
Explained
0.52
Beyond
0.51
0.51
Activations Density 0.000%