I have three records like this
[('a1', ['b1', 'c1', 'd1', 'e1']), ('a2', ['b1', 'c2', 'd2', 'e1']),('a3', ['b1', 'c2', 'd1', 'e2'])]
containing the id for key as the list of the values for each key.
- I want to get the total number of values in the list for each key in pyspark.
- How I can get the list of the common friends between any pair of fiends in pyspark.