INDEX
Explanations
mentions of a particular individual named "Kid" with varying activation strengths
references to the term "Kid" as it relates to specific individuals or contexts
New Auto-Interp
Negative Logits
ãĥ¼ãĥĨãĤ£
-0.78
commission
-0.72
aukee
-0.70
weekday
-0.68
confir
-0.66
merge
-0.65
coalition
-0.64
ministers
-0.63
fuse
-0.63
timestamp
-0.63
POSITIVE LOGITS
Icar
1.12
neys
1.08
ney
0.98
bean
0.93
Doodle
0.90
amac
0.88
Kid
0.87
sie
0.87
stones
0.85
pad
0.83
Activations Density 0.028%