INDEX
Explanations
truthful assertions and statements related to verification
New Auto-Interp
Negative Logits
electr
-0.81
cules
-0.71
senal
-0.69
slightest
-0.68
scrut
-0.68
nomine
-0.68
yip
-0.67
handling
-0.62
coerc
-0.58
strategically
-0.58
POSITIVE LOGITS
Zone
0.79
tenance
0.71
},"
0.71
-|
0.70
},{"0.67
partName
0.64
Chimera
0.63
zone
0.63
:{0.63
atorium
0.63
Activations Density 0.028%