SonarQube API sync: performance and default configuration improvements

<h2 dir="auto" data-pm-slice="1 1 []">Summary</h2>The current SonarQube API sync has a few design choices that could become problematic as the number of projects and findings grows. This issue proposes improvements in two areas: sync efficiency and default import scope.<hr class="wnfdnts _1ibi0s34s _1ibi0s342 _1ibi0s390 _1ibi0s38a _1ibi0s34 _1ibi0s3cu _1ibi0s312" dir="auto"><h2 dir="auto">1. Sync efficiency</h2><h3 dir="auto">Current behavior</h3>The periodic sync task <code>update_findings_from_source_issues</code> (running every 3 hours) currently works as follows:<pre class="_1ibi0s3cl" dir="auto"><code class="language-python">findings = Finding.objects.filter(sonarqube_issue__isnull=False, active=True) # entire table, all at once
for finding in findings:
 client, _ = SonarQubeApiImporter.prepare_client(finding.test) # new client + DB queries per finding
 issue = client.get_issue(sonarqube_issue.key) # 1 HTTP request per finding
</code></pre>With 10,000 synced findings across many projects, this results in:<ul dir="auto"><li dir="auto">10,000 sequential HTTP requests to SonarQube — one per finding, no batching</li><li dir="auto">A new <code>requests.Session</code> and several DB queries per finding to reconstruct the client</li><li dir="auto">All matching findings loaded into memory at once (no <code>iterator()</code> or chunking)</li><li dir="auto">Everything in a single Celery task — no fan-out, no parallelism</li><li dir="auto">No <code>try/except</code> per finding: one network error aborts the entire run for all remaining findings</li></ul><h3 dir="auto">Suggested improvements</h3>a) Batch issue key lookups<code>GET /api/issues/search</code> accepts a comma-separated <code>issues</code> parameter. Findings could be grouped by SonarQube instance and project, then fetched in batches of up to 500 keys per request — reducing 10,000 HTTP calls to ~20.<blockquote class="wnfdntd _1ibi0s35y _1ibi0s358" dir="auto">Note: <code>GET /api/issues/pull</code> also exists and supports a <code>changedSince</code> timestamp for incremental fetches, but it is marked as an internal endpoint and requires both <code>projectKey</code> and <code>branchName</code> as mandatory parameters. Given that DefectDojo tracks branch at the Test level rather than per-finding, this endpoint is not suitable for the sync use case as it stands.</blockquote>b) Group by project and reuse the clientConstruct the client once per project (rather than once per finding) to eliminate redundant DB queries and HTTP session overhead within each sync run.c) Use <code>Finding.objects.iterator()</code>Use Django’s <code>iterator()</code> to avoid loading the entire result set into memory when syncing large numbers of findings.d) Fan out to per-project Celery subtasksFan out into per-project Celery subtasks so that:<ul dir="auto"><li dir="auto">A slow or unreachable SonarQube instance no longer blocks syncs for unrelated projects</li><li dir="auto">Each project's sync can be retried independently</li></ul>e) Wrap each finding's update in <code>try/except</code>Wrap each finding update in <code>try/except</code> so that:<ul dir="auto"><li dir="auto">One transient network error does not abort the entire 3‑hour sync task</li><li dir="auto">The run can continue and log failures individually</li></ul>f) Consider SonarQube webhooks as a complementSonarQube Server supports outbound webhooks that fire after each analysis. These could trigger a targeted re-import in DefectDojo immediately after a scan completes. With webhooks handling the freshness concern, the full periodic sync could be reduced from every 3 hours to once per day — serving only as a safety net for manual SQ
 status changes or missed webhook deliveries.<hr class="wnfdnts _1ibi0s34s _1ibi0s342 _1ibi0s390 _1ibi0s38a _1ibi0s34 _1ibi0s3cu _1ibi0s312" dir="auto"><h2 dir="auto">2. Default import scope includes non-security issue types</h2><h3 dir="auto">Current behavior</h3>When a SonarQube API import is configured without specifying the Extras field, DefectDojo imports all issue types:<ul dir="auto"><li dir="auto"><code>BUG</code></li><li dir="auto"><code>VULNERABILITY</code></li><li dir="auto"><code>CODE_SMELL</code></li><li dir="auto"><code>SECURITY_HOTSPOT</code></li></ul>Given that DefectDojo is a security-focused tool, importing <code>BUG</code> and <code>CODE_SMELL</code> findings:<ul dir="auto"><li dir="auto">Inflates finding counts and pollutes security dashboards</li><li dir="auto">Skews metrics such as MTTR and severity distribution</li><li dir="auto">Adds noise to deduplication logic</li></ul><h3 dir="auto">Suggested change</h3>Change the default fallback to import only security-relevant types:<pre class="_1ibi0s3cl" dir="auto"><code class="language-text">VULNERABILITY,SECURITY_HOTSPOT
</code></pre>Users who need <code>BUG</code> and <code>CODE_SMELL</code> can opt in explicitly via the Extras field. A short note in the documentation and tool configuration UI would make this opt-in visible.<hr class="wnfdnts _1ibi0s34s _1ibi0s342 _1ibi0s390 _1ibi0s38a _1ibi0s34 _1ibi0s3cu _1ibi0s312" dir="auto"><h2 dir="auto">Impact summary</h2>
Impact summary


Default to VULNERABILITY,SECURITY_HOTSPOT
Effort: Low — Impact: High (cleaner data for all new integrations).


try/except per finding in sync loop
Effort: Low — Impact: Medium (prevents full-task aborts).


iterator() on findings queryset
Effort: Low — Impact: Medium (reduces memory pressure).


Batch issue key lookups (up to 500/request)
Effort: Medium — Impact: High (eliminates N+1 HTTP calls).


Group by project, reuse client
Effort: Medium — Impact: High (eliminates N+1 client construction).


Per-project Celery fan-out
Effort: Medium — Impact: Medium (improves isolation and throughput).


Webhook-triggered import
Effort: High — Impact: High (near-realtime sync, reduces polling load).


<hr class="wnfdnts _1ibi0s34s _1ibi0s342 _1ibi0s390 _1ibi0s38a _1ibi0s34 _1ibi0s3cu _1ibi0s312" dir="auto">I'll be happy to help potential contributors as a Sonar expert

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SonarQube API sync: performance and default configuration improvements #14732

Summary

1. Sync efficiency

Current behavior

Suggested improvements

2. Default import scope includes non-security issue types

Current behavior

Suggested change

Impact summary

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

SonarQube API sync: performance and default configuration improvements #14732

Description

Summary

1. Sync efficiency

Current behavior

Suggested improvements

2. Default import scope includes non-security issue types

Current behavior

Suggested change

Impact summary

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions