INDEX
Explanations
information related to historical places, particularly focusing on Chinese history and landmarks
New Auto-Interp
Negative Logits
SERV
-0.99
mathemat
-0.99
PDATE
-0.99
carbohyd
-0.97
psychiat
-0.92
behavi
-0.89
reluct
-0.89
enthusi
-0.88
skelet
-0.86
hemor
-0.86
POSITIVE LOGITS
hai
1.45
ai
1.43
ji
1.33
ja
1.30
wu
1.27
je
1.24
ui
1.21
za
1.18
jin
1.18
gha
1.17
Activations Density 2.800%