Skip to Content.

rare-dev - Re: [rare-dev] nix builds on a bf2556 fails to start bffwd

Subject: Rare project developers

List archive


Re: [rare-dev] nix builds on a bf2556 fails to start bffwd


Chronological Thread 
  • From: Alexander Gall <>
  • To: mc36 <>
  • Cc: <>
  • Subject: Re: [rare-dev] nix builds on a bf2556 fails to start bffwd
  • Date: Wed, 21 Dec 2022 09:50:35 +0100

On Wed, 21 Dec 2022 09:39:41 +0100, mc36 <> said:

> cancel! it was an user error, i had read-only filesystem... after changing
> the uuid in fstab it works fine!

Ah, Ok :)

--
Alex

> thanks for the quick reply,
> cs

> core#show ipv4 lsrp 1 neighbor
> iface router name peerif peer ready uptime
> sdn47.158 10.10.10.11 noti sdn1.158 10.1.1.233 true 00:00:10
> sdn47.159 10.10.10.8 mediapc sdn1.159 10.1.1.229 true 00:00:10
> sdn47.160 10.10.10.2 working sdn1.160 10.1.1.225 true 00:00:11
> sdn47.161 10.10.10.5 safe sdn1.161 10.1.1.221 true 00:00:11
> sdn47.162 10.10.10.199 player sdn1.162 10.1.1.217 true 00:00:09
> sdn47.163 10.10.10.20 nas sdn2.163 10.1.1.213 true 00:00:10
> sdn47.164 10.10.10.1 mchome sdn2.164 10.1.1.209 true 00:00:08

> core#show ipv4 bgp 65535 summary
> neighbor as ready learn sent uptime
> 10.5.1.10 65535 true 3972 19 00:00:05
> 10.26.26.2 65535 true 3972 19 00:00:05

> core#

> On 12/21/22 09:19, mc36 wrote:
>> core#reload process bfswd stop
>> core#exit
>>
>> mc36@stordis:~$ cat /etc/freertr/rtr-hw.txt | grep bfswd
>> proc bfswd
>> /nix/store/bnzr1bd06cl4z044287g7ylz1mqbmmls-RARE-scripts/bin/start_bfswd.sh
>> /etc/freertr/p4-profile /var/log
>> prcpar bfswd act col 500
>> dcfg alias exec tna-set-profile cmd2nd reload process bfswd
>> mc36@stordis:~$
>> mc36@stordis:~$ sudo
>> /nix/store/bnzr1bd06cl4z044287g7ylz1mqbmmls-RARE-scripts/bin/start_bfswd.sh
>> /etc/freertr/p4-profile /var/log
>> [sudo] password for mc36:
>> Using bf_router profile "NOP_MCHOME"
>> mc36@stordis:~$
>>
>>
>>
>>
>>
>> On 12/21/22 09:12, Alexander Gall wrote:
>>> Well, there should be a process called salRefApp. Can you please stop
>>> the freerouter systemd service and then execute directly the command
>>> that's bound to the bfswd alias?
>>>
>>> On Wed, 21 Dec 2022 09:08:12 +0100, mc36 <> said:
>>>
>>>> sorry, typo:
>>>> mc36@stordis:~$ ps aux | grep sal
>>>> root 995 0.0 0.0 546932 30276 ? Sl
>>>> 08:52 0:00
>>>> /nix/store/xpwwghl72bb7f48m51amvqiv1l25pa01-python3-3.9.13/bin/python3.9
>>>> /nix/store/4gg9b1wjrzmyc5bbml1ypwkai80am915-bf_forwarder-2022.12.16/bin/.bf_forwarder.py-wrapped
>>>> --no-log-keepalive --platform=stordis_bf2556x_1t --snmp --ifmibs-dir
>>>> /var/run/rare-snmp --ifindex /etc/snmp/ifindex
>>>> --sal-grpc-server-address=127.0.0.1:50053
>>>> --p4-program-name=bf_router_NOP_MCHOME
>>>> mc36 8811 0.0 0.0 6244 636 pts/0 S+
>>>> 09:07 0:00 grep sal
>>>> mc36@stordis:~$ ps aux | grep sal
>>>> root 995 0.0 0.0 546932 30276 ? Sl
>>>> 08:52 0:00
>>>> /nix/store/xpwwghl72bb7f48m51amvqiv1l25pa01-python3-3.9.13/bin/python3.9
>>>> /nix/store/4gg9b1wjrzmyc5bbml1ypwkai80am915-bf_forwarder-2022.12.16/bin/.bf_forwarder.py-wrapped
>>>> --no-log-keepalive --platform=stordis_bf2556x_1t --snmp --ifmibs-dir
>>>> /var/run/rare-snmp --ifindex /etc/snmp/ifindex
>>>> --sal-grpc-server-address=127.0.0.1:50053
>>>> --p4-program-name=bf_router_NOP_MCHOME
>>>> mc36 8813 0.0 0.0 6244 644 pts/0 S+
>>>> 09:07 0:00 grep sal
>>>> mc36@stordis:~$ ps aux | grep sal
>>>> root 995 0.0 0.0 546932 30276 ? Sl
>>>> 08:52 0:00
>>>> /nix/store/xpwwghl72bb7f48m51amvqiv1l25pa01-python3-3.9.13/bin/python3.9
>>>> /nix/store/4gg9b1wjrzmyc5bbml1ypwkai80am915-bf_forwarder-2022.12.16/bin/.bf_forwarder.py-wrapped
>>>> --no-log-keepalive --platform=stordis_bf2556x_1t --snmp --ifmibs-dir
>>>> /var/run/rare-snmp --ifindex /etc/snmp/ifindex
>>>> --sal-grpc-server-address=127.0.0.1:50053
>>>> --p4-program-name=bf_router_NOP_MCHOME
>>>> mc36 8815 0.0 0.0 6244 704 pts/0 S+
>>>> 09:07 0:00 grep sal
>>>> mc36@stordis:~$ ps aux | grep sal
>>>> root 995 0.0 0.0 546932 30276 ? Sl
>>>> 08:52 0:00
>>>> /nix/store/xpwwghl72bb7f48m51amvqiv1l25pa01-python3-3.9.13/bin/python3.9
>>>> /nix/store/4gg9b1wjrzmyc5bbml1ypwkai80am915-bf_forwarder-2022.12.16/bin/.bf_forwarder.py-wrapped
>>>> --no-log-keepalive --platform=stordis_bf2556x_1t --snmp --ifmibs-dir
>>>> /var/run/rare-snmp --ifindex /etc/snmp/ifindex
>>>> --sal-grpc-server-address=127.0.0.1:50053
>>>> --p4-program-name=bf_router_NOP_MCHOME
>>>> mc36 8817 0.0 0.0 6244 640 pts/0 S+
>>>> 09:07 0:00 grep sal
>>>> mc36@stordis:~$
>>>
>>>
>>>
>>>> On 12/21/22 09:07, mc36 wrote:
>>>>> moreover i cannot spot the process in linux
>>>>>
>>>>>
>>>>> mc36@stordis:~$ ps aux | grep Sal
>>>>> mc36 8436 0.0 0.0 6244
>>>>> 704 pts/0 S+ 09:07 0:00 grep Sal
>>>>> mc36@stordis:~$ ps aux | grep Sal
>>>>> mc36 8453 0.0 0.0 6244
>>>>> 708 pts/0 S+ 09:07 0:00 grep Sal
>>>>> mc36@stordis:~$ ps aux | grep Sal
>>>>> mc36 8455 0.0 0.0 6244
>>>>> 704 pts/0 S+ 09:07 0:00 grep Sal
>>>>> mc36@stordis:~$ ps aux | grep Sal
>>>>> mc36 8457 0.0 0.0 6244
>>>>> 700 pts/0 S+ 09:07 0:00 grep Sal
>>>>> mc36@stordis:~$ ps aux | grep Sal
>>>>> mc36 8459 0.0 0.0 6244
>>>>> 640 pts/0 S+ 09:07 0:00 grep Sal
>>>>> mc36@stordis:~$
>>>>>
>>>>>
>>>>>
>>>>> On 12/21/22 09:06, mc36 wrote:
>>>>>> nothing, the attach is completely empty but that prints out everything
>>>>>> even without a newline....
>>>>>>
>>>>>> On 12/21/22 09:04, Alexander Gall wrote:
>>>>>>> On Wed, 21 Dec 2022 08:53:00 +0100, mc36 <> said:
>>>>>>>
>>>>>>>> hi,
>>>>>>>> here we goo...
>>>>>>>> thanks,
>>>>>>>> cs
>>>>>>>
>>>>>>>> core#tna-list-long-installed
>>>>>>>> Generation Current Release Git Tag
>>>>>>>> KernelID
>>>>>>>> Kernel Release
>>>>>>>> Platform
>>>>>>>> Install date
>>>>>>>> -----------------------------------------------------------------------------------------------------------------------------------------------------------
>>>>>>>> 1 1eta release-1eta
>>>>>>>> Debian11_0
>>>>>>>> 5.10.0-8-amd64
>>>>>>>> stordis_bf2556x_1t 2022-04-16
>>>>>>>> 22:22:46.309693289 +0200
>>>>>>>> 2 1theta
>>>>>>>> release-1eta-16-g29bb914 Debian11_0
>>>>>>>> 5.10.0-8-amd64
>>>>>>>> stordis_bf2556x_1t 2022-04-16
>>>>>>>> 22:22:46.313693289 +0200
>>>>>>>> 3 1theta
>>>>>>>> release-1eta-18-g73197c4 Debian11_0
>>>>>>>> 5.10.0-8-amd64
>>>>>>>> stordis_bf2556x_1t 2022-04-16
>>>>>>>> 22:22:46.309693289 +0200
>>>>>>>> 4 1theta
>>>>>>>> release-1eta-33-ge6051d4 Debian11_0
>>>>>>>> 5.10.0-8-amd64
>>>>>>>> stordis_bf2556x_1t 2022-04-16
>>>>>>>> 22:22:46.313693289 +0200
>>>>>>>> 5 1
>>>>>>>> release-1
>>>>>>>> Debian11_0 5.10.0-8-amd64
>>>>>>>> stordis_bf2556x_1t
>>>>>>>> 2022-12-20 21:31:14.099817139 +0100
>>>>>>>> 6 * 2
>>>>>>>> release-1-111-gd936209-freertr-a66e05 Debian11_0
>>>>>>>> 5.10.0-8-amd64
>>>>>>>> stordis_bf2556x_1t 2022-12-20
>>>>>>>> 22:18:33.437665762 +0100
>>>>>>>
>>>>>>>
>>>>>>>> core#
>>>>>>>> core#show logging process bfswd
>>>>>>>> Using bf_router profile "NOP_MCHOME"
>>>>>>>
>>>>>>> That's only the output of the wrapper script that starts the actual
>>>>>>> SAL. There must be more output. Maybe right after you restart the
>>>>>>> process? Otherwise you can start it by hand just to see what
>>>>>>> happens.
>>>>>>>
>>>>>>> --
>>>>>>> Alex
>>>>>>>
>>>>>>>> core#
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>> On 12/21/22 08:44, Alexander Gall wrote:
>>>>>>>>> Hi
>>>>>>>>>
>>>>>>>>> On the 2556 the bfswd process is actually the SAL which waits for a
>>>>>>>>> connection from bffwd. The actual bf_switchd is only started after
>>>>>>>>> bffwd has connected sucessfully and asked the SAL to start
>>>>>>>>> bf_switchd
>>>>>>>>> so this can't have happened yet if the connection fails. Can
>>>>>>>>> you show
>>>>>>>>> me the output of the bfswd process please?
>>>>>>>>>
>>>>>>>>> Also, which version are you using (tna-list-long-installed)?
>>>>>>>>>
>>>>>>>>> --
>>>>>>>>> Alex
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> On Wed, 21 Dec 2022 08:09:27 +0100, mc36 <> said:
>>>>>>>>>
>>>>>>>>>> hi,
>>>>>>>>>> i've got my bf2556 back... i upgraded to experimental... the bfswd
>>>>>>>>>> process seems stable but the bffwd keeps restarting:
>>>>>>>>>
>>>>>>>>>> core#attach process bffwd
>>>>>>>>>
>>>>>>>>>> SalGrpcClient.TestConnection: Failed to connectd to SAL server at
>>>>>>>>>> 127.0.0.1:50053: <_Rendezvous of RPC that terminated with:
>>>>>>>>>> status = StatusCode.UNAVAILABLE
>>>>>>>>>> details = "channel is in state TRANSIENT_FAILURE"
>>>>>>>>>> debug_error_string =
>>>>>>>>>> "{"created","description":"channel is in
>>>>>>>>>> state
>>>>>>>>>> TRANSIENT_FAILURE","file":"src/core/ext/filters/client_channel/client_channel.cc","file_line":2917,"grpc_status":14}"
>>>>>>>>>>> , retrying
>>>>>>>>>> info cfgPrcss.doRound:cfgPrcss.java:509 restarting process bfswd
>>>>>>>>>> SalGrpcClient.TestConnection: Failed to connectd to SAL server at
>>>>>>>>>> 127.0.0.1:50053: <_Rendezvous of RPC that terminated with:
>>>>>>>>>> status = StatusCode.UNAVAILABLE
>>>>>>>>>> details = "channel is in state TRANSIENT_FAILURE"
>>>>>>>>>> debug_error_string =
>>>>>>>>>> "{"created","description":"channel is in
>>>>>>>>>> state
>>>>>>>>>> TRANSIENT_FAILURE","file":"src/core/ext/filters/client_channel/client_channel.cc","file_line":2917,"grpc_status":14}"
>>>>>>>>>>> , retrying
>>>>>>>>>> info cfgPrcss.doRound:cfgPrcss.java:509 restarting process bfswd
>>>>>>>>>> SalGrpcClient.TestConnection: Failed to connectd to SAL server at
>>>>>>>>>> 127.0.0.1:50053: <_Rendezvous of RPC that terminated with:
>>>>>>>>>> status = StatusCode.UNAVAILABLE
>>>>>>>>>> details = "channel is in state TRANSIENT_FAILURE"
>>>>>>>>>> debug_error_string =
>>>>>>>>>> "{"created","description":"channel is in
>>>>>>>>>> state
>>>>>>>>>> TRANSIENT_FAILURE","file":"src/core/ext/filters/client_channel/client_channel.cc","file_line":2917,"grpc_status":14}"
>>>>>>>>>>> , retrying
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>> this is all i see... any idea on whats going on?
>>>>>>>>>> thanks,
>>>>>>>>>> cs



Archive powered by MHonArc 2.6.19.

Top of Page