INDEX
Explanations
descriptions related to people's personal and professional background
sentences that indicate the end of statements or complete thoughts
New Auto-Interp
Negative Logits
consolation
-0.88
tremend
-0.88
optional
-0.76
nodd
-0.76
calf
-0.76
challeng
-0.75
corrid
-0.75
viability
-0.74
feasible
-0.73
exclus
-0.73
POSITIVE LOGITS
Born
1.49
He
1.45
His
1.42
She
1.39
Previously
1.37
Prior
1.32
During
1.32
Originally
1.31
Known
1.30
Serving
1.26
Activations Density 0.255%