INDEX
Explanations
references to a prestigious hall of fame and its associated recognition or honors
New Auto-Interp
Negative Logits
myſelf
-1.01
pleaſure
-0.89
ſeveral
-0.89
Anſ
-0.88
iſt
-0.86
themſelves
-0.85
uſed
-0.84
Diſ
-0.83
uſ
-0.83
doubtnut
-0.83
POSITIVE LOGITS
hall
2.12
Hall
2.02
halls
1.91
HALL
1.85
hall
1.79
Hall
1.78
Halls
1.76
HALL
1.69
hallway
1.24
зал
1.06
Activations Density 0.034%