INDEX
Explanations
mentions of actions related to authority or exclusive rights
occurrences of the substring "rog" in various contexts
New Auto-Interp
Negative Logits
birth
-0.70
aved
-0.68
goodbye
-0.65
Fas
-0.64
ICAN
-0.57
rix
-0.57
Machina
-0.56
INESS
-0.56
sever
-0.56
fig
-0.55
POSITIVE LOGITS
raphic
1.32
raphics
1.22
raph
1.20
allery
1.07
roup
1.06
aming
0.99
rams
0.95
atory
0.95
roups
0.93
ues
0.92
Activations Density 0.045%