INDEX
Explanations
references to the history of various cultures and civilizations
New Auto-Interp
Negative Logits
anthrop
-0.18
historical
-0.16
convention
-0.15
Anthrop
-0.15
poet
-0.15
467
-0.15
turist
-0.15
tua
-0.14
historian
-0.14
Historical
-0.14
POSITIVE LOGITS
ninete
0.17
Tw
0.17
medicine
0.17
twentieth
0.17
warfare
0.16
è¿ij
0.16
Judaism
0.16
Architecture
0.16
Medicine
0.15
Warfare
0.15
Activations Density 0.175%