INDEX
Explanations
instances of the word "detection" and its variants
New Auto-Interp
Negative Logits
i
-0.83
addFlags
-0.78
whole
-0.72
ي
-0.68
Mindy
-0.67
Winder
-0.64
li
-0.64
zele
-0.64
Thore
-0.64
oro
-0.63
POSITIVE LOGITS
DETECTION
1.12
Detect
1.07
detectors
1.04
DETECT
1.03
detections
1.02
Detectors
1.02
DETECT
1.02
pośred
1.00
Dete
1.00
Detected
0.98
Activations Density 0.187%