INDEX
Explanations
the word "sor" and "gran" with varying activation values
references to different types of sorghum and familial connections
New Auto-Interp
Negative Logits
aments
-0.73
lights
-0.72
Bohem
-0.71
yright
-0.71
Intake
-0.70
Commercial
-0.70
Hague
-0.70
Aviation
-0.70
Lamp
-0.69
Inspector
-0.67
POSITIVE LOGITS
sor
2.85
gran
1.27
retri
1.11
kin
1.11
EFF
1.09
sher
1.09
infer
1.07
fam
1.06
Tide
1.06
berries
1.02
Activations Density 0.059%