INDEX
Explanations
phrases with a focus on specific topics or qualities
mentions of focus, performance, and assessment of characters or entities
New Auto-Interp
Negative Logits
wrote
-0.72
elsen
-0.66
äºĶ
-0.65
pired
-0.65
pas
-0.63
©¶æ
-0.63
NPR
-0.62
Lot
-0.62
Lama
-0.61
pires
-0.61
POSITIVE LOGITS
fundamentals
0.91
rather
0.89
aspects
0.89
basics
0.88
strengths
0.83
topics
0.82
themes
0.82
aesthetics
0.80
outcomes
0.79
uality
0.78
Activations Density 0.630%