Support pip options for requirements #12090

GLeurquin · 2021-05-19T09:08:12Z

I would like to install a pip package with the --no-binary option set.

I tried:

python_requirement_library(name='pydantic', requirements=[
  'pydantic==1.6.1 --no-binary=pydantic'
])

But I get

InvalidFieldException: Invalid requirement 'pydantic==1.6.1 --no-binary=pydantic' in the 'requirements' field for the target 3rdparty/python:pydantic: Parse error at "'--no-bin'": Expected stringEnd

Any way to do this with pants?

The text was updated successfully, but these errors were encountered:

Eric-Arellano · 2021-05-19T21:09:07Z

Hey @GLeurquin, not yet, but very soon! We're doing a major revamp of third party dependencies as our next big project, including:

supporting lockfiles for each tool
autogenerating lockfiles for you, rather than that hacky bash script
multiple lockfiles possible for your own code

Part of that prework for that project is to implement this issue's feature. FYI https://docs.google.com/document/d/1bCYb0UQZx9a-9tAagydCN_z3826QRvz_3aVnXKSNTJw/edit for the overall project proposal

GLeurquin · 2021-05-20T13:00:31Z

Ok, thanks ! For now I'm creating a wheel on a custom repository that is built using the correct flags, then using that as a dependency. Fortunately not too many packages require this, but would indeed be nice to have the option within pants. Thanks for working on it :)

…tion messages (#12180) We already recommend both these options in our docs: https://www.pantsbuild.org/docs/troubleshooting#debug-tip-enable-stack-traces-and-increase-logging. But it's not enough to expect people to find that docs page, we should give the recommendation in the message too. `--no-process-execution-local-cleanup` will become particularly important with #12090, where we will no longer be able to put the requirements installed by Pex directly in the argv and users will need to inspect the generated requirements.txt file to see what was installed. Examples: > (Use --print-stacktrace for more error details and/or --no-process-execution-local-cleanup to inspect chroots and/or -ldebug for more logs. See https://www.pantsbuild.org/v2.6/docs/troubleshooting for common issues. Consider reaching out for help: https://www.pantsbuild.org/v2.6/docs/getting-help.) > (Use --print-stacktrace for more error details and/or -ldebug for more logs. See https://www.pantsbuild.org/v2.6/docs/troubleshooting for common issues. Consider reaching out for help: https://www.pantsbuild.org/v2.6/docs/getting-help.) > (See https://www.pantsbuild.org/v2.6/docs/troubleshooting for common issues. Consider reaching out for help: https://www.pantsbuild.org/v2.6/docs/getting-help.) [ci skip-rust] [ci skip-build-wheels]

Eric-Arellano · 2021-06-08T21:02:35Z

@wilsonliam is going to work on this 🎉

See https://pip.pypa.io/en/stable/cli/pip_install/#requirements-file-format for how pip supports options, both global options for the whole requirements file and options for each individual requirement. The below instructions are only for per-requirement options, not global options. @GLeurquin, I'm not sure the below will thus actually solve your issue description. But this can be seen as pre-work for that possible followup. And we must have the --hash support for the upcoming rework of 3rd party requirements.

This can be done in two steps. It probably makes sense for them to be separate PRs, e.g. easier to revert if there are issues.

1: use requirements.txt

Right now, we pass all requirements to install as arguments directly to Pex, e.g. pex Django==1.1 -o foo.pex. The Pex CLI doesn't support Pip arguments.

Instead, we need to use a requirements.txt file and something like pex -r requirements.txt -o foo.pex.

Note, we are not going to use the user's literal requirements.txt file, as requirements can be specified "inline" like with python_requirement_library targets, or the new Poetry macro. Instead, we must generate a requirements.txt file, probably named generated_requirements.txt or requirements.generated.txt. Use await Get(Digest, CreateDigest) to do that.

Specifically, update the build_pex rule to no longer do this:

pants/src/python/pants/backend/python/util_rules/pex.py

Line 348 in 715ab8f

argv.extend(request.requirements)

Instead, create the digest, merge it in the input_digest, and update the argv to do -r generated_requirements.txt.

I don't think we need to add new tests for this - it's an implementation detail. We only need to make sure the Pex related tests still pass.

2: allow `python_requirement_library` to have pip args

We currently use pkg_resources.Requirement for the PythonRequirementLibrary target:

pants/src/python/pants/backend/python/target_types.py

Lines 534 to 583 in 715ab8f

    
           class _RequirementSequenceField(Field): 
        
               value: tuple[Requirement, ...] 
        
               @classmethod 
        
               def compute_value( 
        
                   cls, raw_value: Optional[Iterable[str]], address: Address 
        
               ) -> Tuple[Requirement, ...]: 
        
                   value = super().compute_value(raw_value, address) 
        
                   if value is None: 
        
                       return () 
        
                   invalid_type_error = InvalidFieldTypeException( 
        
                       address, 
        
                       cls.alias, 
        
                       value, 
        
                       expected_type="an iterable of pip-style requirement strings (e.g. a list)", 
        
                   ) 
        
                   if isinstance(value, str) or not isinstance(value, collections.abc.Iterable): 
        
                       raise invalid_type_error 
        
                   result = [] 
        
                   for v in value: 
        
                       # We allow passing a pre-parsed `Requirement`. This is intended for macros which might 
        
                       # have already parsed so that we can avoid parsing multiple times. 
        
                       if isinstance(v, Requirement): 
        
                           result.append(v) 
        
                       elif isinstance(v, str): 
        
                           try: 
        
                               parsed = Requirement.parse(v) 
        
                           except Exception as e: 
        
                               raise InvalidFieldException( 
        
                                   _format_invalid_requirement_string_error( 
        
                                       v, 
        
                                       e, 
        
                                       description_of_origin=( 
        
                                           f"the '{cls.alias}' field for the target {address}" 
        
                                       ), 
        
                                   ) 
        
                               ) 
        
                           result.append(parsed) 
        
                       else: 
        
                           raise invalid_type_error 
        
                   return tuple(result) 
        
           class PythonRequirementsField(_RequirementSequenceField): 
        
               alias = "requirements" 
        
               required = True 
        
               help = ( 
        
                   "A sequence of pip-style requirement strings, e.g. `['foo==1.8', " 
        
                   "\"bar<=3 ; python_version<'3'\"]`." 
        
               )

This is really useful because we can do things like get the .project_name for dependency inference:

pants/src/python/pants/backend/python/dependency_inference/module_mapper.py

Line 248 in 715ab8f

canonicalize_project_name(req.project_name),

We want to keep using pkg_resources.Requirement, but we also need to capture any additional Pip args. We can use composition to do this (rather than inheritance). Something like:

@dataclass(frozen=True)
class PythonRequirement
    req: pkg_resources.Requirement
    pip_args: tuple[str, ...] = ()

Or probably this would be easier to implement. We don't really need to split the pip args because all we want is to preserve them so that we can put them in the generated requirements.txt. We're not doing anything like looking at individual elements.

@dataclass(frozen=True)
class PythonRequirement
    req: pkg_resources.Requirement
    pip_args: str | None = None

You will want to add an @classmethod parse() that takes a string and creates a PythonRequirement. Similar to your Poetry project, test-driven development is particularly helpful for writing this. You'll probably want to use Python's str.split() method and set maxsplit: https://www.w3schools.com/python/ref_string_split.asp

You will also want to implement __str__ to recombine back into a single string. It'd be good to have a unit test for that.

You'll want to define this in backend/python/target_types.py and then hook it up to PythonRequirementsField. Note that we still want to allow passing either a pre-parsed PythonRequirement (for macros) or a raw string. You'll need to update _RequirementSequenceField, and you can remove that and inline it all into PythonRequirementsField if easier.

To keep things simple, you should be able to keep PexRequirements from pex.py the same for now, that it stores strings rather than PythonRequirements. An advantage of that is that we don't need to update the call sites where we create tool PEXes like for Pylint and Black - they stay the same. We're going to be touching that code a lot to support the overall redesign of 3rdparty requirements, so we can reduce churn for now.

pants/src/python/pants/backend/python/util_rules/pex.py

Lines 59 to 70 in b971c85

    
           class PexRequirements(DeduplicatedCollection[str]): 
        
               sort_input = True 
        
               @classmethod 
        
               def create_from_requirement_fields( 
        
                   cls, 
        
                   fields: Iterable[PythonRequirementsField], 
        
                   *, 
        
                   additional_requirements: Iterable[str] = (), 
        
               ) -> PexRequirements: 
        
                   field_requirements = {str(python_req) for field in fields for python_req in field.value} 
        
                   return PexRequirements({*field_requirements, *additional_requirements})

We probably want to add an integration test to pex_test.py. I'm thinking test that --hash works correctly. Choose a requirement like ansicolors that doesn't have any dependencies, and do something like create a simple Poetry project with it and run poetry export to get all the hashes. Copy that entire requirement string with all the hashes, and test that you can correctly build the PEX. Then, in the same test, now use the same requirement but mess up each of the hashes, e.g. change their values by a letter; check that that fails. (We need that failing case to prove that hashes are actually being consumed, that we don't just ignore them. We also want the succeeding case to have positive confirmation things work and that the PEX didn't fail to build for an unrelated reason.)

benjyw · 2022-04-02T17:43:57Z

Addressed in #14985

benjyw · 2022-04-05T19:14:20Z

Fixed in #14985

Eric-Arellano self-assigned this May 19, 2021

Eric-Arellano mentioned this issue Jun 8, 2021

Suggest -ldebug and --no-process-execution-local-cleanup in exception messages #12180

Merged

alexey-tereshenkov-oxb mentioned this issue Jun 15, 2021

Is it possible to modify default arguments of pex when packaging with Pants? #12205

Closed

Eric-Arellano mentioned this issue Jul 13, 2021

poetry_requirements does not handle source field #12327

Closed

Eric-Arellano mentioned this issue Aug 13, 2021

Lockfiles: determine if Poetry meets our requirements #12568

Closed

Eric-Arellano removed their assignment Dec 7, 2021

Eric-Arellano mentioned this issue Dec 22, 2021

Figure out how to handle VCS requirements with Python lockfiles and --hash #13965

Closed

Eric-Arellano mentioned this issue Mar 9, 2022

Allow for --no-binary to be requested for certain dependencies #14685

Closed

stuhood added the onboarding Issues that affect a new user's onboarding experience label Mar 31, 2022

benjyw closed this as completed Apr 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support pip options for requirements #12090

Support pip options for requirements #12090

GLeurquin commented May 19, 2021

Eric-Arellano commented May 19, 2021

GLeurquin commented May 20, 2021

Eric-Arellano commented Jun 8, 2021

benjyw commented Apr 2, 2022

benjyw commented Apr 5, 2022

Support pip options for requirements #12090

Support pip options for requirements #12090

Comments

GLeurquin commented May 19, 2021

Eric-Arellano commented May 19, 2021

GLeurquin commented May 20, 2021

Eric-Arellano commented Jun 8, 2021

1: use requirements.txt

2: allow python_requirement_library to have pip args

benjyw commented Apr 2, 2022

benjyw commented Apr 5, 2022

2: allow `python_requirement_library` to have pip args