INDEX
Explanations
instances where something is being checked or examined
repeated mentions of the word "checked."
New Auto-Interp
Negative Logits
thora
-0.69
iens
-0.66
ennes
-0.64
parallels
-0.62
nery
-0.62
olution
-0.59
iasm
-0.59
DEM
-0.58
reement
-0.58
Lever
-0.57
POSITIVE LOGITS
checked
3.63
checked
2.61
checking
1.87
check
1.83
checks
1.81
inspected
1.80
checking
1.66
check
1.60
checks
1.56
tested
1.55
Activations Density 0.009%