INDEX
Explanations
personal opinions or reflections
personal pronouns and expressions of individual perspective
New Auto-Interp
Negative Logits
Loading
-0.74
Stock
-0.71
Availability
-0.70
Lot
-0.65
Appearances
-0.63
kernels
-0.62
Detailed
-0.62
Spawn
-0.62
Output
-0.60
dism
-0.60
POSITIVE LOGITS
friends
0.73
coworkers
0.72
colleagues
0.69
å·
0.69
myself
0.68
ACTED
0.66
usalem
0.65
friends
0.65
opic
0.65
mates
0.65
Activations Density 0.425%