INDEX
Explanations
specific mentions of ranches
mentions of ranches
New Auto-Interp
Negative Logits
lihood
-0.68
dism
-0.65
ABLE
-0.63
*/(
-0.61
Bok
-0.60
chromos
-0.59
Wong
-0.58
Bucc
-0.57
decom
-0.56
Commonwealth
-0.56
POSITIVE LOGITS
ranch
0.96
ing
0.83
ington
0.82
Bundy
0.81
yard
0.81
agna
0.80
aires
0.77
Ranch
0.76
oise
0.75
naire
0.74
Activations Density 0.017%