Films where two specific actors appeared together in the sakila (mysql) db. - Databases

TopAnswers Databases

Meta

Databases

TeX

Code Golf

APL

C++

.net

db<>fiddle

Java

*nix

PHP

PowerShell

Python

Rust

टेक्-मराठी

Typst

Web Client Dev

Web Server Dev

Films where two specific actors appeared together in the sakila (mysql) db.

mysql add tag

anoldmaninthesea

I'm following a book which states that if we wanted to show a table for the films where Cate MacQueen and Cuba Birch both appeared, I should run the following command:




```
select f.release_year, f.title, concat(a1.first_name," ", a1.last_name)
from film f
	inner join film_actor fa1
	on f.film_id=fa1.film_id
    
	inner join actor a1
    on fa1.actor_id=a1.actor_id
    
    inner join film_actor fa2
	on f.film_id=fa2.film_id
    
    inner join actor a2
    on fa2.actor_id=a2.actor_id
    
where ((a1.first_name="CATE" and a1.last_name="mcqueen") 
	and (a2.first_name="cuba" and a2.last_name="BiRcH"));
```


My question is why the need for the second inner join  film_actor fa2 ? Why can't we just do


```
select f.release_year, f.title, concat(a1.first_name," ", a1.last_name)
from film f

	inner join film_actor fa1
	on f.film_id=fa1.film_id
    
	inner join actor a1
    on fa1.actor_id=a1.actor_id
    
    inner join actor a2
    on fa1.actor_id=a2.actor_id
    
where ((a1.first_name="CATE" and a1.last_name="mcqueen") 
	and (a2.first_name="cuba" and a2.last_name="BiRcH"));
```
?


Edit: I tried my way, but it just returned an empty set, instead a 2-row table like in the book.

Top Answer

Jack Douglas

The second `film_actor` join is required because in each case it is being restricted to a single actor. The query is to get films where two actors appear together.

It's perhaps slightly clearer what's happening in this transformation of your query:

<>https://dbfiddle.uk/?rdbms=mysql_8.0&sample=sakila&fiddle=e96c0765b0d337328fe0e570896ee0b5

Logically you can do the same thing without joining `film_actor` twice:

<>https://dbfiddle.uk/?rdbms=mysql_8.0&sample=sakila&fiddle=d531515ee337dc2c7d3a12e576fe2592

But don't assume this will be faster, RDBMS's were born to `join`.

No, your version couldn't have worked because its founded on a misunderstanding of how joins work. The thing people have trouble with is understanding that every join is a `cross join` and then a filter. if you really get that you can visualise the joins better and you'll realise that the two filters are working against each other:

2) Tables<> result theme: So, a result is different from a table? Because I'm learning this now, I got the idea that everything was a table... the results of queries were tables, etc. I'll read more on this then.

Jack, 1) I meant when can I know that I'll need to add an extra join on a table in order to avoid a restriction like that ? Because from the code itself, my version could have worked (if SQL allowed it)... I had to run the command to see if it worked. Maybe what I 'm wondering is about the internal working of SQL . My experience with other programming languages is that I can understand what the code does without even running it (for small examples like this). However, with SQL, it seems I always need to run it to see how it will be interpreted by the 'server'.

> could you add «concat(a2.first_name," ", a2.last_name) as full_name_a2» to the ‘super’ select? Which one is the 'super' select? If you mean the second one then the answer is "no, not easily" — that's one of the limitations of that approach! You can do it with a "pivot", see [here on SO](https://stackoverflow.com/a/16000899/12757754) for more details, but if you need the actor name(s) you should almost certainly go for the `join` approach instead

> Is there a way to realise when SQL will assume the tables to be the same (before we run the code) I'm afraid I don't know what you mean by 'assume' — also when you say 'tables' I think you mean 'result', calling the result a table will be very confusing to people

1 sugestion: could you add «concat(a2.first_name," ", a2.last_name) as full_name_a2» to the 'super' select? I think it would improve your answer. 1 doubt: Is there a way to realise when SQL will assume the tables to be the same (before we run the code)? 1 compliment : the more I read you answer, the more I like it. =D

1 Answer