INDEX
Explanations
names of people and organizations
mentions of notable individuals or names related to entertainment or media
New Auto-Interp
Negative Logits
Tokens
-0.70
Xi
-0.69
respectively
-0.68
depending
-0.65
cause
-0.63
Flight
-0.62
Plex
-0.60
Compare
-0.60
Fed
-0.60
ADRA
-0.60
POSITIVE LOGITS
writes
0.91
Photography
0.91
Profile
0.89
Quote
0.87
Jr
0.87
Interview
0.86
(@
0.85
wrote
0.85
discusses
0.81
aka
0.79
Activations Density 0.293%