INDEX
Explanations
the word "Great" with a high activation strength
references to significant historical events, particularly those denoted by the term "Great."
New Auto-Interp
Negative Logits
quo
-0.73
qqa
-0.63
rett
-0.63
pport
-0.62
lessly
-0.61
mitt
-0.61
pta
-0.60
dictated
-0.60
ded
-0.58
mitted
-0.58
POSITIVE LOGITS
Recession
1.23
Depression
1.09
Lakes
1.04
Barrier
0.98
Plains
0.92
Basin
0.91
Divide
0.91
batch
0.89
Pumpkin
0.86
Dane
0.85
Activations Density 0.022%