Dplyr: Duplicated columns in join get suffixes

Created on 5 Dec 2013 · 9Comments · Source: tidyverse/dplyr

e <- data.frame(x = c(1, 1, 2, 3), z = 1:4)
f <- data.frame(x = c(1, 2, 2, 4), z = 1:4)

j <- inner_join(tbl_cpp(e), tbl_cpp(f), "x")

if the same name exists in both the x and y sources, then the variable names in the output get .x and .y added.

bug

Source

hadley

Most helpful comment

I find the default suffixes .x .y rarely useful. Is it sensible to provide an argument "suffixes" for all joins (defaulting to c(".x", ".y")) a la merge()? Or is there an easy way to do that with available tools?

rmatev on 22 Jan 2015

👍3

All 9 comments

BTW this is the last error I get from replacing tbl_df with tbl_cpp - I'll merge in that big change once this one is fixed.

hadley on 5 Dec 2013

Alright. I'll start on that right now then.

romainfrancois on 5 Dec 2013

rmatev on 22 Jan 2015

👍3

+1 to rmatev's comment.

The missing suffix change functionality from the joins sometimes makes me reluctantly use merge()

A fix would be great. Thanks for all your work.

napeednus on 11 Feb 2015

👍1

I agree. Would be nice to pick the suffix.

rickyars on 22 Apr 2015

👍1

One more who ended up here hoping for a suffix option. Or a function like rename_each to pipe in before the join.

sfirke on 28 Apr 2015

👍1

+1 to rmatev's comment: Having customizable suffices like for base::merge() would help a lot.

holgerbrandl on 7 Sep 2015

nigelhenry on 20 Apr 2016

Upvote for the suffix option.

pmBarlev on 23 Jun 2016

Was this page helpful?

0 / 5 - 0 ratings

Related issues

Use mutate_at with multiple sets of .vars and multiple .funs

md0u80c9 · 4Comments

Selecting all numeric columns

steromano · 4Comments

Surprising difference in tidy evaluation between select and count

slyrus · 3Comments

selecting vars with `starts_with`, `ends_with`, `contains` and `matches` return wrong result when given pattern does not exist

leondutoit · 3Comments

Filter not working with quo_name

DasHammett · 3Comments