INDEX
Explanations
personal information about various individuals, such as birthdates, occupations, and nationality
individuals' names and their associated backgrounds or professions
New Auto-Interp
Negative Logits
CONCLUS
-0.89
ctory
-0.68
Saying
-0.67
reversal
-0.65
inaction
-0.62
dstg
-0.61
insensitive
-0.61
silence
-0.60
forgetting
-0.60
forcement
-0.59
POSITIVE LOGITS
graduated
1.25
specializes
1.19
lived
1.14
grew
1.12
resided
1.11
founded
1.10
embodies
1.09
belonged
1.08
resides
1.07
originated
1.05
Activations Density 0.386%