Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] VM failed to get IP after node reboot #2903

Closed
guangbochen opened this issue Oct 11, 2022 · 2 comments
Closed

[BUG] VM failed to get IP after node reboot #2903

guangbochen opened this issue Oct 11, 2022 · 2 comments
Assignees
Labels
area/network Harvester network related blocker blocker of major functionality kind/bug Issues that are defects reported by users or that we know have reached a real release not-require/test-plan Skip to create a e2e automation test issue priority/0 Must be fixed in this release reproduce/always Reproducible 100% of the time
Milestone

Comments

@guangbochen
Copy link
Contributor

Describe the bug
VM with VLAN network failed to get IP after node reboot

To Reproduce
Steps to reproduce the behavior:

  1. Create a new cluster network and add related network configs
  2. Create networks with associated vlan id
  3. Create a new VM using this VLAN network and after it starts successfully reboot the node
  4. After the node reboot, the VM failed to retrieve the IPv4.
  5. Check the bridge vlan the associated PVID does not exist.

Expected behavior
VM should be able to get IP back after node reboot.

Environment

  • Harvester ISO version: v1.1.0-rc2
  • Underlying Infrastructure (e.g. Baremetal with Dell PowerEdge R630):

Additional context
Add any other context about the problem here.

@guangbochen guangbochen added kind/bug Issues that are defects reported by users or that we know have reached a real release area/network Harvester network related priority/0 Must be fixed in this release blocker blocker of major functionality reproduce/always Reproducible 100% of the time labels Oct 11, 2022
@guangbochen guangbochen added this to the v1.1.0 milestone Oct 11, 2022
@guangbochen guangbochen added the not-require/test-plan Skip to create a e2e automation test issue label Oct 11, 2022
@harvesterhci-io-github-bot
Copy link
Collaborator

Pre Ready-For-Testing Checklist

  • If labeled: require/HEP Has the Harvester Enhancement Proposal PR submitted?
    The HEP PR is at:

  • Where is the reproduce steps/test steps documented?
    The reproduce steps/test steps are at:

  • Is there a workaround for the issue? If so, where is it documented?
    The workaround is at:

  • Have the backend code been merged (harvester, harvester-installer, etc) (including backport-needed/*)?
    The PR is at:

    • Does the PR include the explanation for the fix or the feature?

    • Does the PR include deployment change (YAML/Chart)? If so, where are the PRs for both YAML file and Chart?
      The PR for the YAML change is at:
      The PR for the chart change is at:

  • If labeled: area/ui Has the UI issue filed or ready to be merged?
    The UI issue/PR is at:

  • If labeled: require/doc, require/knowledge-base Has the necessary document PR submitted or merged?
    The documentation/KB PR is at:

  • If NOT labeled: not-require/test-plan Has the e2e test plan been merged? Have QAs agreed on the automation test case? If only test case skeleton w/o implementation, have you created an implementation issue?

    • The automation skeleton PR is at:
    • The automation test case PR is at:
  • If the fix introduces the code for backward compatibility Has a separate issue been filed with the label release/obsolete-compatibility?
    The compatibility issue is filed at:

@irishgordo irishgordo self-assigned this Oct 17, 2022
@irishgordo
Copy link
Contributor

@guangbochen - this looks good - testing on a single node v1.1.0-rc3

I do notice that the pvid is correct on the bridge used for one of the cluster networks:

rancher@v110-rc3-vm-node:~> ip -d link show dev test-net-1-br | grep -ie 'vlan'
    bridge forward_delay 1500 hello_time 200 max_age 2000 ageing_time 30000 stp_state 0 priority 32768 vlan_filtering 1 vlan_protocol 802.1Q bridge_id 8000.0:0:0:0:0:0 designated_root 8000.0:0:0:0:0:0 root_port 0 root_path_cost 0 topology_change 0 topology_change_detected 0 hello_timer    0.00 tcn_timer    0.00 topology_change_timer    0.00 gc_timer   43.52 vlan_default_pvid 1 vlan_stats_enabled 0 vlan_stats_per_port 0 group_fwd_mask 0 group_address 01:80:c2:00:00:00 mcast_snooping 1 mcast_router 1 mcast_query_use_ifaddr 0 mcast_querier 0 mcast_hash_elasticity 16 mcast_hash_max 4096 mcast_last_member_count 2 mcast_startup_query_count 2 mcast_last_member_interval 100 mcast_membership_interval 26000 mcast_querier_interval 25500 mcast_query_interval 12500 mcast_query_response_interval 1000 mcast_startup_query_interval 3124 mcast_stats_enabled 0 mcast_igmp_version 2 mcast_mld_version 1 nf_call_iptables 0 nf_call_ip6tables 0 nf_call_arptables 0 addrgenmode eui64 numtxqueues 1 numrxqueues 1 gso_max_size 65536 gso_max_segs 65535 

As well as upon multiple cluster networks and multiple reboots the IPvv4 address.
Additional test artifacts:
Screenshot from 2022-10-17 15-22-05
Screenshot from 2022-10-17 15-15-27
Screenshot from 2022-10-17 14-59-39

test_2903_test.mp4

I'll go ahead and close this out.
Tested with multiple reboots.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/network Harvester network related blocker blocker of major functionality kind/bug Issues that are defects reported by users or that we know have reached a real release not-require/test-plan Skip to create a e2e automation test issue priority/0 Must be fixed in this release reproduce/always Reproducible 100% of the time
Projects
None yet
Development

No branches or pull requests

4 participants