INDEX
Explanations
mentions of craters or containers
references to craters and crates
New Auto-Interp
Negative Logits
govtrack
-0.79
ilded
-0.73
oga
-0.73
ki
-0.71
ests
-0.70
yo
-0.69
ahon
-0.69
romy
-0.69
ould
-0.69
idy
-0.67
POSITIVE LOGITS
lain
1.02
士
0.81
Sigma
0.79
Barnett
0.72
ing
0.69
crater
0.69
cake
0.65
berth
0.64
ufact
0.64
zinski
0.62
Activations Density 0.030%