INDEX
Explanations
words related to physical appearance and actions
instances of commas in varied contexts
New Auto-Interp
Negative Logits
¬¼
-0.50
iple
-0.46
izon
-0.44
odes
-0.42
ety
-0.42
orn
-0.39
ahu
-0.38
ocl
-0.38
rou
-0.37
cel
-0.37
POSITIVE LOGITS
albeit
0.86
namely
0.81
although
0.75
respectively
0.74
etc
0.70
however
0.68
though
0.68
whereas
0.67
including
0.64
aka
0.62
Activations Density 1.174%