INDEX
Explanations
verbs related to identification or detection
New Auto-Interp
Negative Logits
orld
-0.79
fare
-0.79
ework
-0.72
icum
-0.69
acquitted
-0.65
steel
-0.65
obar
-0.63
reeling
-0.63
hetti
-0.63
stead
-0.62
POSITIVE LOGITS
weaknesses
0.84
flaws
0.82
pointers
0.76
ively
0.75
landmarks
0.74
shortcomings
0.73
gaps
0.72
deficiencies
0.71
vulnerabilities
0.70
criteria
0.69
Activations Density 0.042%