INDEX
Explanations
decimal points or version numbers in software updates
New Auto-Interp
Negative Logits
affili
-0.76
deliber
-0.75
mislead
-0.70
embell
-0.66
indo
-0.66
thr
-0.65
offic
-0.64
boycot
-0.64
deduct
-0.64
fallacy
-0.63
POSITIVE LOGITS
dylib
1.15
0
0.99
jar
0.94
gz
0.94
jpg
0.93
7601
0.92
840
0.90
1
0.88
zip
0.88
compatible
0.84
Activations Density 0.020%