INDEX
Explanations
sections related to summaries and detailed descriptions of research findings and methodologies
New Auto-Interp
Negative Logits
information
-0.77
results
-0.72
description
-0.69
use
-0.66
analysis
-0.66
explanation
-0.64
knowledge
-0.64
data
-0.63
experience
-0.63
response
-0.63
POSITIVE LOGITS
Preparation
1.21
Calculation
1.19
Requirements
1.15
Approval
1.13
Results
1.11
Prediction
1.10
Details
1.09
Requirement
1.09
Comparison
1.09
Procedure
1.08
Activations Density 0.874%