INDEX
Explanations
details related to a specific object or concept
New Auto-Interp
Negative Logits
irie
-0.78
idy
-0.70
ãĥīãĥ©
-0.66
mx
-0.64
izen
-0.63
enstein
-0.62
gang
-0.62
ļéĨĴ
-0.62
iple
-0.62
orem
-0.61
POSITIVE LOGITS
albeit
1.48
namely
1.38
although
1.38
though
1.28
especially
1.26
however
1.23
but
1.17
except
1.10
especially
1.08
particularly
1.07
Activations Density 1.365%