INDEX
Explanations
phrases related to significant events or entities
the term "great" in various contexts
New Auto-Interp
Negative Logits
ople
-0.96
cling
-0.73
ilus
-0.73
endif
-0.71
gging
-0.70
©¶æ
-0.68
PD
-0.67
utical
-0.67
former
-0.66
NG
-0.66
POSITIVE LOGITS
strides
0.91
sword
0.88
apes
0.79
boon
0.76
itud
0.74
importance
0.70
pains
0.70
proportions
0.70
Dane
0.70
offence
0.69
Activations Density 0.034%