Invantive limits on WHERE IN () clause for performance optimizations

sda · 2 april 2024 om 10:49

We are building a very specific script. In this script we have the following statement:

select * 
from   transactionlines@sqlserver
where id IN (select id from temp@inmemorystorage)

We don’t want to use a JOIN stmt because transactionlines@sqlserver contains more than 3 million rows, and it will trigger a full load of the SQL server table then make the join at Invantive’s side.
temp@inmemorystorage should be rather small, may be less than 20K records.

We found that if temp@inmemorystorage as small set records (<150) the above statement works fine.

Now if temp@inmemorystorage contains more results e.g. 14K results,
we found that we have a truncated result: it seems that Invantive does not forwards all the results of the IN clause to the SQL-container.

and if we execute the following:

select *
from   transactionlines@sqlserver
where  id IN (select id from temp@inmemorystorage LIMIT 100000)

then Invantive forward all the results in the IN clause and we get the results as expected.

Is the above approach (add a LIMIT clause to force forward all results to sql server) a good one ?

forums · 3 april 2024 om 15:03

The statement should work fine with say 14K rows in temp@inmemorystorage.

An efficient and better way is probably to switch join strategy using the join_set hint (see Invantive UniversalSQL Grammar 23.0). join_set is used by default when the left-hand side has few entries (such as 5000) to be joined upon.

A join_set uses an IN for the respective platform instead of a full join, and is typically used for instance when joining a last of open sales orders with it’s associated lines. It can be hundred times faster when there are many sales orders, but only 1% is open.

An example:

select /*+ join_set(t1, id, 20000) */ *
from   temp@inmemorystorage t1
join   transactionlines@sqlserver t1
on     t2.id = t1.id

The documentation on Invantive UniversalSQL grammar describes the functionality of join_set.

Some other use cases are available at:

Please check in advance that both id columns have the same datatype. Otherwise an implicit datatype conversion will occur, triggering a very expensive join.

forums · 18 april 2024 om 17:55

This question was automatically closed after at least 1 week of inactivity after a possible solution was provided. The last answer given has been marked as a solution.

Please ask a new question via a separate topic if the problem occurs again. Please include a link to this topic in the new question by pasting its URL into the text.

system · 25 april 2024 om 17:55

Dit topic is 7 dagen na het laatste antwoord automatisch gesloten. Nieuwe antwoorden zijn niet meer toegestaan.

Topic		Antwoorden	Weergaven
Incorrect outcome of SELECT ... from ... where id NOT IN (List of ids) Questions invantive-sql	3	623	20 september 2023
Faster loading from execute native Questions invantive-sql , postgresql	7	464	31 oktober 2023
Performance and scalability improvements on UPDATE and DELETE SQL statements Wiki (en) performance , invantive-sql	0	806	2 april 2021
Query Tool MAX() , Order By ... statements speed question with SQL Server Questions invantive-psql , invantive-query-tool , sql-server , performance	2	818	20 september 2023
SQL strange behaviour with a begin + loop + create or replace @inmemorystorage Questions invantive-sql	7	794	20 september 2023

Invantive limits on WHERE IN () clause for performance optimizations

Gerelateerde topics