INDEX
Explanations
references to the name "Peter"
occurrences of the name "Peter."
New Auto-Interp
Negative Logits
actionGroup
-0.80
req
-0.78
shown
-0.74
doors
-0.70
shapeshifter
-0.70
ornia
-0.69
hower
-0.67
2048
-0.67
appropriately
-0.67
ãĥ¼ãĥ³
-0.66
POSITIVE LOGITS
Parker
1.05
bilt
0.99
Peter
0.96
angelo
0.89
Thiel
0.89
Johann
0.88
Gabriel
0.87
Damian
0.86
Berg
0.83
Abram
0.83
Activations Density 0.008%