INDEX
Explanations
personal names
proper nouns, specifically names of individuals and places
New Auto-Interp
Negative Logits
âĶĢâĶĢ
-0.71
Nex
-0.68
daytime
-0.68
schedule
-0.63
profit
-0.63
alt
-0.62
heel
-0.61
equivalent
-0.60
increment
-0.59
volume
-0.59
POSITIVE LOGITS
baugh
1.33
ingham
1.21
love
1.20
shall
1.16
heimer
1.12
idge
1.11
alter
1.09
ley
1.07
enberg
1.07
inski
1.06
Activations Density 0.263%