INDEX
Explanations
references to different states or state-related concepts
New Auto-Interp
Negative Logits
Pilgri
-0.80
Piac
-0.80
}),
-0.77
helves
-0.76
fusca
-0.73
Crusaders
-0.73
Jezus
-0.72
Afonso
-0.72
encom
-0.71
Rouen
-0.71
POSITIVE LOGITS
State
1.43
state
1.42
STATE
1.36
States
1.35
states
1.35
states
1.34
States
1.29
state
1.28
STATE
1.25
State
1.20
Activations Density 0.125%