World: r3wp

Join the discussions in the REBOL3 world...

[!RebDB] REBOL Pseudo-Relational Database

older newer	first last
Pekr 8-Feb-2006 [6]	hmm, actually join has to do the same, so :-)
Ashley 8-Feb-2006 [7]	JOIN differs from SUB-SELECT where you want to aggregate the columns of more than one table, so: select a.col, b.col from a, b cannot be refactored as a sub-select. There are two reasons why I have not implemented JOINs in RebDB [yet]: 1) Dramatic increase in code complexity 2) You can almost always do it more efficiently in REBOL as you know the data structures and desired result set in advance. About the only time this does not work well [in RebDB] is where you have to pull the contents of more than one table across a network to derive a small subset as the result set. So while this SQL would not suffer: select a.col, b.col from a, b this might: select a.key, b.val from a, b where a.key = b.key depending on the size of b.
Pekr 8-Feb-2006 [8]	yes, but what you describe is more than day-by-day example of proper db usage. Even with small projects, when using 3NF notation, you simply store only foreign keys in tables, so those "aggregate" functions are needed too often ...
Anton 8-Feb-2006 [9x2]	So it looks like the problem is Pekr's expectation that the database have certain common functions, often implemented in other databases. Switching from one database to another is hard because they don't all support the same functions. I think RebDB, as a youthful, new, growing database, should not be expected to have everything yet.
Anton 8-Feb-2006 [9x2]	But, Petr, perhaps it would be good if you could show the operation and tables that you are having trouble with, and Ashley can help you find the best way. Maybe the performance will be more than you need, and the expression simpler in rebol.
Pekr 8-Feb-2006 [11]	no, Anton, everything is ok, no problems with columns, or switching .... I will describe you one example ....
Ashley 8-Feb-2006 [12]	It's called a "Pseudo-Relational Database" for a good reason. ;) But in answer to Pekr's previous point; you can work out how to do that aggregation more efficiently than the limited AI of most RDBMS systems. I've worked at DB2 and Oracle shops where it was mandated that all JOINs be performed inline! SQL optimizers are not trivial to write, and the meta-data overhead (indexes, statistics, hints, etc) required to get them to make the "right" choices are often a false economy.
Anton 8-Feb-2006 [13]	So, Petr, you solved the original problem which occurred for you ?
Pekr 8-Feb-2006 [14x4]	ok, here it is: you have tables called 'orders, 'order-items, 'companies .....
	in 'companies, you have 'company-id, 'company-name, 'company-address ....
	then, when you enter new order into 'orders, you simply assign it to company, by using and storing foreign key, so 'company-id ...
	And now you have typical problem - you surely don't want your grid to show 'company-id, but 'company-name or other related data - but those are not present in 'orders database ...
Anton 8-Feb-2006 [18]	Orders Order-Items Companies company-id, company-name, company-address ?
Pekr 8-Feb-2006 [19x3]	that is imo correct design to simply avoid data inconsistency. Imagine if you would put 'company-name into 'oders, and later on company changes it's name a bit - you simply want that info to have stored only once ...
	yes ...
	would you store company-name in orders directly? Surely not (although we are doing it too for some purposes, as getting grid drawn fast)
Ashley 8-Feb-2006 [22]	Data structure makes sense, what's the problematic SQL statement? (in its simplest form)
Pekr 8-Feb-2006 [23x2]	inner join orders, order items, left outer join companies ....
Pekr 8-Feb-2006 [23x2]	so the inner join? (agregate)
Ashley 8-Feb-2006 [25]	Just write the SQL statement as you would in MySQL, etc ... then we can see where the issues are.
Pekr 8-Feb-2006 [26x5]	ok, here's my rebol odbc code for ADS (Advantage Database Server): insert db-port trim/lines { SELECT DISTINCT ko.kodfyzak, fk.nazev, fk.ulice, fk.mesto, fk.psc, fk.kodzeme, zk.land1, upper(fk.kodmeny), upper(e.ico), ko.kodfypri1, fp.nazev, fp.ulice, fp.mesto, fp.psc, fp.kodzeme, zp.land1, upper(fp.kodmeny) FROM zakp_exp e INNER JOIN Kontjkv5 k5 ON k5.zakazka = concat(e.miv, left(e.czak, 6)) LEFT OUTER JOIN kontrakt ko ON ko.ciskontr = k5.ciskontr LEFT OUTER JOIN firma fk ON fk.kodfirmy = ko.kodfyzak AND fk.aktualni = 'A' LEFT OUTER JOIN firma fp ON fp.kodfirmy = ko.kodfypri1 AND fp.aktualni = 'A' LEFT OUTER JOIN zem zk ON fk.kodzeme = zk.kodzeme LEFT OUTER JOIN zem zp ON fp.kodzeme = zp.kodzeme LEFT OUTER JOIN delka_psc psc ON zp.land1 = psc.land1 ORDER BY fk.nazev, fp.nazev }
	so let's forget those aliases, which are nice things to have too :-)
	left outer join seems like kind of subselect)
	while inner join seems to be kind of aggregate ...
	but that is maybe unnecessarily complicated example, we were migrating data to SAP R3 from our old systems .....
Ashley 8-Feb-2006 [31]	An example based on what you are trying to do with Orders, Items and Companies might be better to start with.
Pekr 8-Feb-2006 [32]	that was just an example simple schema, but well, I may try to write short SQL query from that ;-)
Ashley 8-Feb-2006 [33]	Stick to basic JOIN syntax in the form: select b.name, b.address from a, b where a.id = b.id and ... those INNER JOIN and LEFT OUTER JOIN statements are unfamiliar to me and I get a headache just looking at them. ;)
Pekr 8-Feb-2006 [34x4]	SELECT o.order-id, o.date, o.amount, o.total, oi.item-id, io.item-price, oi.goods-name, c.company-name FROM orders o INNER JOIN order-items oi ON oi.order-id = o.order.id LEFT OUTER JOIN companies c ON c.company-id = o.company-id WHERE o.amount > 2000 ORDER BY c.company-name, o.order-id, o-item-id
	not sure I got it syntactically 100% correct, but you got the idea :-)
	so basically select row defines you final format of result block ...
	inner join simply "adds rows" of order-items per order
Sunanda 8-Feb-2006 [38]	Not sure if I'm off topic here.....But you can do pretty much everything in SQL without using JOIN. Though you may need UNION, subselects and WHERE. Which makes for more portable SQL as JOIN syntax often used vendor-specific extensions to the SQL-92 standard.
Pekr 8-Feb-2006 [39x2]	outer join just selects company name ...
Pekr 8-Feb-2006 [39x2]	Yes, that might be possible ... I am not that skilled in SQL yet, so maybe it could be done other way ....
Sunanda 8-Feb-2006 [41]	I learned before JOIN existed.....Means you are more skilled than me :-)
Pekr 8-Feb-2006 [42x2]	maybe it is a pity rebol can't do union and join on more per record base, but per item base ...
Pekr 8-Feb-2006 [42x2]	that join simply creates one big table for you, which is nice, as you simply can traverse in one loop ....
Ashley 8-Feb-2006 [44]	UNION cannot substitute for JOIN. UNION aggregates the rows returned by multiple statements while JOIN [can] aggregate the columns returned by referring to multiple tables. A subtle but important distinction. For example: Table-A Col ==== 1 2 Table-B Col ==== A B select a.col, b.col from a, b 1 A 1 B 2 A 2 B select * from a union select * from b 1 2 A B
Pekr 8-Feb-2006 [45x2]	do not understand the syntax of select from a,b ... actually - never understood it :-) our db allows select, the rest is crosslinked via joins :-)
Pekr 8-Feb-2006 [45x2]	we would need some join/key ... to define key on which to join two blocks to create third one ... or maybe union/key ...
Gabriele 8-Feb-2006 [47]	ashley, my old dbms3.r supports joins, though probably far from doing it efficiently. in case you need the code feel free to use it.
Ashley 8-Feb-2006 [48]	Thanks, I'll take a look at it. (Marco's sql-protocol.r is also a good read).
Ashley 9-Feb-2006 [49]	dbms3.r isn't in the library, where can I grab a copy from?
Graham 9-Feb-2006 [50x2]	rebol.it is a good bet.
Graham 9-Feb-2006 [50x2]	oohh.. looks like a Cobalt Raq.
Ashley 9-Feb-2006 [52]	Back to Pekr's JOIN problem. First, let's reformat the SQL into something more readable: select o.order-id, o.date, o.amount, o.total, oi.item-id, io.item-price, oi.goods-name, c.company-name from orders o , order-items oi , companies c where o.order.id = oi.order-id and o.company-id = c.company-id and o.amount > 2000 order by c.company-name, o.order-id, o-item-id
Graham 9-Feb-2006 [53]	not here either http://www.colellachiara.com/soft/
Ashley 9-Feb-2006 [54x2]	Then break it down into discrete queries and wrap it in some loops: blk: copy [] foreach [company-id order-id date amount total] sql [ select [company-id order-id date amount total] from orders where [amount > 2000] ][ company-name: second sql compose [lookup companies (company-id)] foreach [item-id item-price goods-name] sql compose [select * from order-items where (order-id)] [ insert tail blk reduce [order-id date amount total item-id item-price goods-name company-name] ] ] sort/skip/compare blk 8 [8 1 5]
Ashley 9-Feb-2006 [54x2]	The final [untested] solution is about as efficient as you can get. Adding JOIN support to RebDB so it can break the query down into similar steps is not a simple task. In all but the most trivial of cases you'd be better off coding it yourself. What might be a good idea is to add something that lets you more easily specify the most common JOIN operation - master/detail with optional LOV (List Of Values) lookup(s). Your query is a classic example of this construct and it accounts for a surprisingly large number of queries. The function would accept two queries, a master query (the orders table in your case) and a details query (the order-items table) and an optional block of column/LOV-table pairs (that perform substitutions such as company-name). The skeleton would look like: sql-join [master-query [block!] detail-query [block!] /order /lov [block!] ] [ buffer: copy [] ... buffer ] and would be used as such: sql-join/order/lov [ select [company-id order-id date amount total] from orders where [amount > 2000] ] [ select * from order-items where %ID% ] [8 1 5] [company-id companies] Would this make things a tad easier?
older newer	first last