INDEX
Explanations
words related to proper nouns and locations
words related to specific names or titles, particularly focusing on the prefix 'B' and similar patterns
New Auto-Interp
Negative Logits
FORMATION
-0.69
\/\/
-0.67
RFC
-0.65
Malays
-0.60
FTC
-0.60
xual
-0.59
ĸļ
-0.58
Whereas
-0.57
ources
-0.57
ngth
-0.57
POSITIVE LOGITS
levard
1.11
lehem
1.01
pillar
0.99
apest
0.94
hammad
0.88
rill
0.80
etooth
0.79
abase
0.78
ause
0.78
aneers
0.77
Activations Density 0.126%