INDEX
Explanations
specific entities or names containing the letters "NR" followed by a digit
mentions of "NR" followed by numbers, indicating a focus on specific ratings or identifiers
New Auto-Interp
Negative Logits
congr
-0.77
Metatron
-0.70
Izan
-0.67
undone
-0.65
dates
-0.64
Minecraft
-0.61
heartedly
-0.61
Fairfax
-0.61
forcing
-0.60
Villa
-0.60
POSITIVE LOGITS
NR
0.91
UTH
0.80
ACK
0.79
NR
0.79
otiation
0.77
yss
0.75
OST
0.75
ud
0.74
ativity
0.74
UV
0.73
Activations Density 0.022%