I'm currently trying to join two sources of data where the key is an interval (say [date-begin/date-end]). For example:
input1:
20091001-20091002    A
20091011-20091104    B
20080111-20091103    C
(...)
input2:
20090902-20091003    D
20081015-20091204    E
20040011-20050101    F
(...)
I'd like to find all the records where the key1 overlaps the key2. Is it possible with hadoop ? Where can I find an example of implementation ?