INDEX
Explanations
words related to simplification and making things easier
words related to simplification and making complex ideas easier to understand
New Auto-Interp
Negative Logits
affer
-0.71
wine
-0.66
ambassadors
-0.65
eminent
-0.64
rings
-0.63
vine
-0.62
kar
-0.62
CVE
-0.61
tide
-0.60
Member
-0.58
POSITIVE LOGITS
simplicity
0.90
simpl
0.88
simplify
0.84
Catalog
0.82
fusc
0.81
Simpl
0.79
simplified
0.78
ose
0.76
simpler
0.76
ABE
0.74
Activations Density 0.035%