INDEX
Explanations
words related to tolerance and intolerance
New Auto-Interp
Negative Logits
s
-0.70
ethene
-0.68
-0.66
BeforeClass
-0.65
Higgs
-0.64
-0.63
n
-0.62
$
-0.60
Biggs
-0.60
-0.60
POSITIVE LOGITS
Toler
1.69
tolerance
1.60
Toler
1.57
Tolerance
1.51
tolerances
1.47
tolerant
1.46
toler
1.46
toler
1.36
tolerant
1.34
olerance
1.30
Activations Density 0.010%