INDEX
Explanations
proper nouns like countries and specific regions
New Auto-Interp
Negative Logits
DragonMagazine
-0.84
ocious
-0.84
utenberg
-0.80
reads
-0.71
transformative
-0.70
ovie
-0.68
otive
-0.68
remem
-0.68
disruptive
-0.68
ewitness
-0.67
POSITIVE LOGITS
Netherlands
1.40
Luxembourg
1.31
Samoa
1.29
Belgium
1.24
France
1.23
Philippines
1.23
Romania
1.21
Switzerland
1.21
Chile
1.21
Denmark
1.21
Activations Density 0.133%