INDEX
Explanations
instances where individuals express thoughts or feelings about a particular topic
phrases expressing possession or ownership
New Auto-Interp
Negative Logits
Vog
-0.73
unker
-0.69
Carbuncle
-0.68
Lyndon
-0.68
Hutch
-0.68
haus
-0.65
Lerner
-0.65
Rowling
-0.65
Osc
-0.65
Rudolph
-0.63
POSITIVE LOGITS
own
1.57
self
1.07
condolences
1.03
selves
0.96
Own
0.95
favourite
0.95
thoughts
0.93
displeasure
0.92
preferred
0.90
resignation
0.89
Activations Density 0.219%