INDEX
Explanations
phrases related to the term "Big"
mentions of significant entities or groups
New Auto-Interp
Negative Logits
confir
-0.96
theless
-0.83
idency
-0.79
istry
-0.74
ILA
-0.69
anwhile
-0.69
livest
-0.69
autop
-0.67
lawfully
-0.67
Dialogue
-0.67
POSITIVE LOGITS
gest
1.41
ger
1.26
gie
0.97
gers
0.93
glers
0.92
wig
0.90
gins
0.90
GER
0.87
ging
0.86
Integer
0.84
Activations Density 0.021%