INDEX
Explanations
chemical elements and compounds
phrases and terms related to reasoning and logic
New Auto-Interp
Negative Logits
mable
-0.79
eworld
-0.73
ORN
-0.72
rush
-0.71
MN
-0.71
snap
-0.70
TED
-0.70
ought
-0.70
Spread
-0.69
OVER
-0.68
POSITIVE LOGITS
Gentleman
0.80
sophistic
0.80
bourgeoisie
0.76
dilig
0.73
Volks
0.69
bou
0.69
gentlemen
0.68
bourg
0.68
du
0.68
Ãł
0.67
Activations Density 0.324%