Optimizing Bulk Merge into Azure SQL Hyperscale Without Staging Tables

Question

Optimizing Bulk Merge into Azure SQL Hyperscale Without Staging Tables

Janice Chi 620

JDBC Bulk Update Support

Can the Microsoft JDBC driver handle true bulk updates (not just inserts), or is the recommendation always to use a staging table and in-database MERGE?

  Are there driver settings (e.g., `sendStringParametersAsUnicode`, `packetSize`, `batchSize`) that improve throughput for batched updates/inserts at our scale (3k–70k events/sec)?
  
  **TVP (Table-Valued Parameters) from JDBC**
  
     Is TVP fully supported in the Microsoft JDBC driver for Azure SQL Hyperscale in production workloads?
     
        What are the practical limitations of TVPs (row size, number of rows per TVP, transaction log impact) when we push 5k–20k rows per micro-batch?
        
           Is there any benchmark data comparing TVP throughput vs. staging table MERGE for high-volume streaming?
           
           **Latency & Concurrency**
           
              At high concurrency (multiple Databricks streams writing into Hyperscale), what are the recommended limits for batch size and transaction duration to avoid log contention or throttling?
              
                 How does Hyperscale handle concurrent MERGE operations against the same table if we choose TVP or direct JDBC batch instead of staging?
                 
                 **Deletes & CDC Handling**
                 
                    For deletes (marked in CDC payloads), is it more efficient to batch DELETE statements, or to rely on a TVP MERGE that includes delete flags?
                    
                       Any gotchas with delete-heavy workloads when avoiding staging?
                       
                       **Temporary Tables**
                       
                          If we create a **#temp table inside a stored procedure** to load rows and then MERGE, does Hyperscale optimize this as efficiently as a permanent staging table?
                          
                             Is there any guidance on temp table usage in Hyperscale for high-frequency micro-batch merges?
                             
                             **Bulk Copy API vs. TVP**
                             
                                Bulk Copy (`SQLServerBulkCopy`) is very fast for inserts. Can it be combined with an update/delete pattern in Hyperscale, or is it strictly insert-only?
                                
                                   For insert-only bursts, is Bulk Copy recommended over TVP, and what are the trade-offs?
                                   
                                   **Resiliency & Retries**
                                   
                                      If a TVP or JDBC batch transaction fails mid-micro-batch, what is the best retry pattern: re-send the entire micro-batch, or break into smaller chunks?
                                      
                                         Any Hyperscale-specific features (retry logic, partitioning hints) that can help avoid deadlocks or long rollbacks?
                                         
                                         **Future Roadmap**
                                         
                                            Is Microsoft planning any enhancements to allow direct MERGE INTO from external sources (e.g., Delta/Parquet in ADLS) to Hyperscale, avoiding the need for staging or TVP workarounds?
                                            
                                               Any upcoming features in the Microsoft JDBC driver that improve bulk update/merge performance?

Mahesh Kurva 7,570 Reputation points Microsoft External Staff Moderator

2025-08-25T19:19:01.7933333+00:00
Hi Janice Chi,

Greetings!!

JDBC Bulk Update Support:

The Microsoft JDBC driver can perform bulk operations, but using a staging table is often recommended for bulk updates, especially when you're merging large volumes of data. It helps manage performance and concurrency effectively.

Driver Settings:

Adjusting these settings can help:

sendStringParametersAsUnicode: This can be useful if you're working with Unicode data.

packetSize: Default is usually 8096 bytes; you may test increasing this for larger batches.

batchSize: Setting a suitable batch size can drastically enhance throughput, so consider experimenting to find the sweet spot for your load (3k-70k events/sec).

TVP Support:

The JDBC driver does support Table-Valued Parameters (TVPs), but it's crucial to keep in mind practical limitations:

Row Size: TVPs can generally handle a decent number of rows, but stay below 10,000 rows per TVP to avoid issues.

Transaction Log Impact: The size and number of rows can increase log usage significantly especially when running at high volumes.

Concurrency and Performance:

For high concurrency with multiple streams, consider:

Keep batch sizes smaller to reduce contention.

Test transaction durations to minimize locking.

Monitor lock contention and throughput metrics to dynamically adjust as needed.

Merge Operations:

When using concurrent merges via TVP or Direct JDBC batches:

Azure SQL Hyperscale can handle concurrent operations, but the performance may vary based on locking and transaction isolation levels.

It’s often better to have distinct connections per operation to reduce contention.

Handling Deletes Efficiently:

For delete operations tied to CDC payloads, TVP merges with flags could be beneficial, but batching DELETE statements can also help depending on your workload. Be wary of potential locking issues if deletes are frequent.

Temporary Tables vs. Staging Tables:

Creating temp tables for quick load may not outperform permanent staging tables in all scenarios. Testing with your expected load is key to seeing what works best in Hyperscale for frequency and size.

Bulk Copy API:

The SQLServerBulkCopy is primarily for insert operations. Mixing with updates/deletes can be complex and isn't a straightforward single command.

Resiliency & Retries:

If a batch fails, breaking it into smaller chunks for retrying often results in better overall recovery time while avoiding long rollbacks.

Future Enhancements:

Keep an eye on Azure updates, as they may roll out features improving direct merge capabilities or JDBC performance enhancements.

For more information, please refer the documents:

https://learn.microsoft.com/en-us/sql/connect/jdbc/performing-batch-operations?view=sql-server-ver17

https://dotnettutorials.net/lesson/bulk-operations-using-t-sql-command-in-ado-net-core/

https://docs.azure.cn/en-us/azure-sql/database/hyperscale-performance-diagnostics

https://learn.microsoft.com/en-us/sql/connect/ado-net/sql/transaction-bulk-copy-operations?view=sql-server-ver17

https://github.com/microsoft/mssql-jdbc

I hope this information helps. Please do let us know if you have any further queries.
Mahesh Kurva 7,570 Reputation points Microsoft External Staff Moderator

2025-08-26T18:10:25.23+00:00

Hi Janice Chi,

We haven’t heard from you on the last response and was just checking back to see if you have a resolution yet. In case if you have any resolution please do share that same with the community as it can be helpful to others. Otherwise, will respond with more details and we will try to help.
Mahesh Kurva 7,570 Reputation points Microsoft External Staff Moderator

2025-08-28T18:25:57.73+00:00

Hi Janice Chi,

We haven’t heard from you on the last response and was just checking back to see if you have a resolution yet. In case if you have any resolution please do share that same with the community as it can be helpful to others. Otherwise, will respond with more details and we will try to help.

Your answer

Mahesh Kurva 7,570 Reputation points Microsoft External Staff Moderator

2025-08-26T18:10:25.23+00:00

Hi Janice Chi,

We haven’t heard from you on the last response and was just checking back to see if you have a resolution yet. In case if you have any resolution please do share that same with the community as it can be helpful to others. Otherwise, will respond with more details and we will try to help.
Mahesh Kurva 7,570 Reputation points Microsoft External Staff Moderator

2025-08-28T18:25:57.73+00:00

Hi Janice Chi,

We haven’t heard from you on the last response and was just checking back to see if you have a resolution yet. In case if you have any resolution please do share that same with the community as it can be helpful to others. Otherwise, will respond with more details and we will try to help.

Share via

Optimizing Bulk Merge into Azure SQL Hyperscale Without Staging Tables

Your answer