INDEX
Explanations
names of characters or locations ending in 'ang' or 'ong'
New Auto-Interp
Negative Logits
rees
-0.69
ively
-0.68
ngth
-0.67
ional
-0.65
ortun
-0.62
ibles
-0.60
alion
-0.59
uously
-0.59
iaries
-0.58
ives
-0.58
POSITIVE LOGITS
hound
0.61
lda
0.60
mberg
0.59
Olymp
0.58
wolf
0.58
ford
0.58
gang
0.56
ORPG
0.54
Yang
0.53
PAC
0.53
Activations Density 7.703%