INDEX
Explanations
proper nouns
references to specific individuals, particularly the name "Stern."
New Auto-Interp
Negative Logits
teen
-0.87
orate
-0.77
20439
-0.75
yrs
-0.70
OPS
-0.68
Dub
-0.67
tesy
-0.67
[|
-0.66
NCT
-0.66
xxxxxxxx
-0.65
POSITIVE LOGITS
Stern
1.19
bach
0.98
sonian
0.95
stein
0.94
berg
0.94
Eisen
0.93
beck
0.87
Berman
0.79
alties
0.75
itzer
0.74
Activations Density 0.008%