INDEX
Explanations
references to a particular acronym "WR" followed by a numerical value
references to the 'WR' designation, likely indicating a specific type of character or item identifier in a context
New Auto-Interp
Negative Logits
anamo
-0.80
Maid
-0.65
uations
-0.63
omorphic
-0.62
onyms
-0.59
Crosby
-0.58
Belg
-0.58
Debor
-0.57
cles
-0.57
ovic
-0.56
POSITIVE LOGITS
ONG
1.18
ACK
1.08
EST
0.96
MJ
0.93
IGHT
0.90
WR
0.88
IVES
0.87
ITT
0.86
ATH
0.86
IGHTS
0.84
Activations Density 0.009%