INDEX
Explanations
names of notable individuals or significant figures
names of individuals mentioned in the text
New Auto-Interp
Negative Logits
tune
-0.68
USC
-0.65
chem
-0.65
womb
-0.61
Hurricane
-0.61
CTR
-0.59
Spectre
-0.58
NRS
-0.58
Mandatory
-0.58
TBA
-0.57
POSITIVE LOGITS
apologised
1.03
itsch
0.89
yip
0.88
cott
0.87
zinski
0.87
ij士
0.87
opoulos
0.84
iewicz
0.84
arde
0.83
ovich
0.82
Activations Density 0.134%