INDEX
Explanations
references to historical contexts, specifically regarding ancient civilizations and their attributes
New Auto-Interp
Negative Logits
favorably
-0.83
colorless
-0.81
flavorful
-0.81
Favorite
-0.80
Еще
-0.78
favorite
-0.77
theater
-0.77
favorite
-0.76
multicolored
-0.76
neighbors
-0.75
POSITIVE LOGITS
Australian
1.16
Australia
1.15
1.07
Australians
1.06
Australian
0.99
AUSTRALIA
0.95
Queensland
0.95
Sydney
0.94
Australia
0.94
Aussie
0.92
Activations Density 0.284%