Skip to Content.

rare-dev - Re: [rare-dev] nix builds on a bf2556 fails to start bffwd

Subject: Rare project developers

List archive


Re: [rare-dev] nix builds on a bf2556 fails to start bffwd


Chronological Thread 
  • From: mc36 <>
  • To: Alexander Gall <>
  • Cc:
  • Subject: Re: [rare-dev] nix builds on a bf2556 fails to start bffwd
  • Date: Wed, 21 Dec 2022 09:19:31 +0100

core#reload process bfswd stop
core#exit

mc36@stordis:~$ cat /etc/freertr/rtr-hw.txt | grep bfswd
proc bfswd
/nix/store/bnzr1bd06cl4z044287g7ylz1mqbmmls-RARE-scripts/bin/start_bfswd.sh
/etc/freertr/p4-profile /var/log
prcpar bfswd act col 500
dcfg alias exec tna-set-profile cmd2nd reload process bfswd
mc36@stordis:~$
mc36@stordis:~$ sudo
/nix/store/bnzr1bd06cl4z044287g7ylz1mqbmmls-RARE-scripts/bin/start_bfswd.sh
/etc/freertr/p4-profile /var/log
[sudo] password for mc36:
Using bf_router profile "NOP_MCHOME"
mc36@stordis:~$





On 12/21/22 09:12, Alexander Gall wrote:
Well, there should be a process called salRefApp. Can you please stop
the freerouter systemd service and then execute directly the command
that's bound to the bfswd alias?

On Wed, 21 Dec 2022 09:08:12 +0100, mc36 <> said:

sorry, typo:
mc36@stordis:~$ ps aux | grep sal
root 995 0.0 0.0 546932 30276 ? Sl 08:52 0:00
/nix/store/xpwwghl72bb7f48m51amvqiv1l25pa01-python3-3.9.13/bin/python3.9
/nix/store/4gg9b1wjrzmyc5bbml1ypwkai80am915-bf_forwarder-2022.12.16/bin/.bf_forwarder.py-wrapped
--no-log-keepalive --platform=stordis_bf2556x_1t --snmp --ifmibs-dir
/var/run/rare-snmp --ifindex /etc/snmp/ifindex
--sal-grpc-server-address=127.0.0.1:50053
--p4-program-name=bf_router_NOP_MCHOME
mc36 8811 0.0 0.0 6244 636 pts/0 S+ 09:07 0:00 grep sal
mc36@stordis:~$ ps aux | grep sal
root 995 0.0 0.0 546932 30276 ? Sl 08:52 0:00
/nix/store/xpwwghl72bb7f48m51amvqiv1l25pa01-python3-3.9.13/bin/python3.9
/nix/store/4gg9b1wjrzmyc5bbml1ypwkai80am915-bf_forwarder-2022.12.16/bin/.bf_forwarder.py-wrapped
--no-log-keepalive --platform=stordis_bf2556x_1t --snmp --ifmibs-dir
/var/run/rare-snmp --ifindex /etc/snmp/ifindex
--sal-grpc-server-address=127.0.0.1:50053
--p4-program-name=bf_router_NOP_MCHOME
mc36 8813 0.0 0.0 6244 644 pts/0 S+ 09:07 0:00 grep sal
mc36@stordis:~$ ps aux | grep sal
root 995 0.0 0.0 546932 30276 ? Sl 08:52 0:00
/nix/store/xpwwghl72bb7f48m51amvqiv1l25pa01-python3-3.9.13/bin/python3.9
/nix/store/4gg9b1wjrzmyc5bbml1ypwkai80am915-bf_forwarder-2022.12.16/bin/.bf_forwarder.py-wrapped
--no-log-keepalive --platform=stordis_bf2556x_1t --snmp --ifmibs-dir
/var/run/rare-snmp --ifindex /etc/snmp/ifindex
--sal-grpc-server-address=127.0.0.1:50053
--p4-program-name=bf_router_NOP_MCHOME
mc36 8815 0.0 0.0 6244 704 pts/0 S+ 09:07 0:00 grep sal
mc36@stordis:~$ ps aux | grep sal
root 995 0.0 0.0 546932 30276 ? Sl 08:52 0:00
/nix/store/xpwwghl72bb7f48m51amvqiv1l25pa01-python3-3.9.13/bin/python3.9
/nix/store/4gg9b1wjrzmyc5bbml1ypwkai80am915-bf_forwarder-2022.12.16/bin/.bf_forwarder.py-wrapped
--no-log-keepalive --platform=stordis_bf2556x_1t --snmp --ifmibs-dir
/var/run/rare-snmp --ifindex /etc/snmp/ifindex
--sal-grpc-server-address=127.0.0.1:50053
--p4-program-name=bf_router_NOP_MCHOME
mc36 8817 0.0 0.0 6244 640 pts/0 S+ 09:07 0:00 grep sal
mc36@stordis:~$



On 12/21/22 09:07, mc36 wrote:
moreover i cannot spot the process in linux


mc36@stordis:~$ ps aux | grep Sal
mc36 8436 0.0 0.0 6244 704 pts/0 S+ 09:07
0:00 grep Sal
mc36@stordis:~$ ps aux | grep Sal
mc36 8453 0.0 0.0 6244 708 pts/0 S+ 09:07
0:00 grep Sal
mc36@stordis:~$ ps aux | grep Sal
mc36 8455 0.0 0.0 6244 704 pts/0 S+ 09:07
0:00 grep Sal
mc36@stordis:~$ ps aux | grep Sal
mc36 8457 0.0 0.0 6244 700 pts/0 S+ 09:07
0:00 grep Sal
mc36@stordis:~$ ps aux | grep Sal
mc36 8459 0.0 0.0 6244 640 pts/0 S+ 09:07
0:00 grep Sal
mc36@stordis:~$



On 12/21/22 09:06, mc36 wrote:
nothing, the attach is completely empty but that prints out everything even
without a newline....

On 12/21/22 09:04, Alexander Gall wrote:
On Wed, 21 Dec 2022 08:53:00 +0100, mc36 <> said:

hi,
here we goo...
thanks,
cs

core#tna-list-long-installed
Generation Current Release Git Tag
KernelID Kernel Release Platform
Install date
-----------------------------------------------------------------------------------------------------------------------------------------------------------
1 1eta release-1eta
Debian11_0 5.10.0-8-amd64 stordis_bf2556x_1t
2022-04-16 22:22:46.309693289 +0200
2 1theta release-1eta-16-g29bb914 Debian11_0
5.10.0-8-amd64 stordis_bf2556x_1t
2022-04-16 22:22:46.313693289 +0200
3 1theta release-1eta-18-g73197c4 Debian11_0
5.10.0-8-amd64 stordis_bf2556x_1t
2022-04-16 22:22:46.309693289 +0200
4 1theta release-1eta-33-ge6051d4 Debian11_0
5.10.0-8-amd64 stordis_bf2556x_1t
2022-04-16 22:22:46.313693289 +0200
5 1 release-1
Debian11_0 5.10.0-8-amd64 stordis_bf2556x_1t
2022-12-20 21:31:14.099817139 +0100
6 * 2 release-1-111-gd936209-freertr-a66e05
Debian11_0 5.10.0-8-amd64 stordis_bf2556x_1t
2022-12-20 22:18:33.437665762 +0100


core#
core#show logging process bfswd
Using bf_router profile "NOP_MCHOME"

That's only the output of the wrapper script that starts the actual
SAL. There must be more output. Maybe right after you restart the
process? Otherwise you can start it by hand just to see what happens.

--
Alex

core#



On 12/21/22 08:44, Alexander Gall wrote:
Hi

On the 2556 the bfswd process is actually the SAL which waits for a
connection from bffwd. The actual bf_switchd is only started after
bffwd has connected sucessfully and asked the SAL to start bf_switchd
so this can't have happened yet if the connection fails. Can you show
me the output of the bfswd process please?

Also, which version are you using (tna-list-long-installed)?

--
Alex


On Wed, 21 Dec 2022 08:09:27 +0100, mc36 <> said:

hi,
i've got my bf2556 back... i upgraded to experimental... the bfswd process
seems stable but the bffwd keeps restarting:

core#attach process bffwd

SalGrpcClient.TestConnection: Failed to connectd to SAL server at
127.0.0.1:50053: <_Rendezvous of RPC that terminated with:
status = StatusCode.UNAVAILABLE
details = "channel is in state TRANSIENT_FAILURE"
debug_error_string =
"{"created","description":"channel is in
state
TRANSIENT_FAILURE","file":"src/core/ext/filters/client_channel/client_channel.cc","file_line":2917,"grpc_status":14}"
, retrying
info cfgPrcss.doRound:cfgPrcss.java:509 restarting process bfswd
SalGrpcClient.TestConnection: Failed to connectd to SAL server at
127.0.0.1:50053: <_Rendezvous of RPC that terminated with:
status = StatusCode.UNAVAILABLE
details = "channel is in state TRANSIENT_FAILURE"
debug_error_string =
"{"created","description":"channel is in
state
TRANSIENT_FAILURE","file":"src/core/ext/filters/client_channel/client_channel.cc","file_line":2917,"grpc_status":14}"
, retrying
info cfgPrcss.doRound:cfgPrcss.java:509 restarting process bfswd
SalGrpcClient.TestConnection: Failed to connectd to SAL server at
127.0.0.1:50053: <_Rendezvous of RPC that terminated with:
status = StatusCode.UNAVAILABLE
details = "channel is in state TRANSIENT_FAILURE"
debug_error_string =
"{"created","description":"channel is in
state
TRANSIENT_FAILURE","file":"src/core/ext/filters/client_channel/client_channel.cc","file_line":2917,"grpc_status":14}"
, retrying


this is all i see... any idea on whats going on?
thanks,
cs



Archive powered by MHonArc 2.6.19.

Top of Page