INDEX
Explanations
Roman numerals IX (9) and X (10)
New Auto-Interp
Negative Logits
--------------------------------------------------------
-0.71
rule
-0.65
clutch
-0.63
TPS
-0.62
mount
-0.58
fixes
-0.58
thritis
-0.58
xon
-0.58
Serious
-0.57
forth
-0.56
POSITIVE LOGITS
iew
1.10
irus
1.09
isions
1.09
olution
1.06
isible
1.03
entric
1.00
ision
1.00
itamin
0.97
iral
0.97
orst
0.97
Activations Density 0.030%