INDEX
Negative Logits
slapped
0.80
repeated
0.78
repetition
0.77
malware
0.74
workload
0.73
Malware
0.72
cohesion
0.71
Reverse
0.71
offender
0.70
vulnerability
0.70
POSITIVE LOGITS
usepackage
1.11
import
0.80
Pow
0.79
intro
0.77
algèbre
0.76
space
0.76
Intro
0.76
oldsymbol
0.75
Need
0.74
umenical
0.73
Activations Density 0.000%