INDEX
Explanations
people's names or terms related to specific individuals
occurrences of the substring "ury"
New Auto-Interp
Negative Logits
olkien
-0.79
eworld
-0.78
chart
-0.78
eared
-0.73
cedented
-0.72
insula
-0.72
Gutenberg
-0.70
mercial
-0.70
gotten
-0.68
hatt
-0.66
POSITIVE LOGITS
ury
1.04
stein
0.87
ous
0.70
uay
0.69
âķIJ
0.68
pants
0.66
Sul
0.65
urs
0.65
ry
0.64
von
0.63
Activations Density 0.006%