RS3D and RS4D randomly stop reporting on StationView

Two shakes, a RSD3 running .20, and a RS4D running .21, randomly stop reporting data on StationView. They do, however, continue to produce data locally, and both show active server connections. The co-located Boom running .21 does not exhibit this behavior.

All three machines have active NTP sync, albeit to differing pools of servers.

I note the machines are uploading via a Starlink connection that is subject to random interruptions, but I’d expect that the links would recover, particularly since the internet and connection statuses are showing as good.

Any hints on where to start attacking this problem would be appreciated. Nothing glaringly obvious jumps out of the logs.

Hello wiley42,

Could you post the logs from all three instruments when you can? Having one that doesn’t show these transmission issues could give us a clue for the other two.

Thank you!

Sure thing! Should be attached, upload gods willing.

A few things:

  • The 3D that fell off StationView magically showed up again. This is the same 3D that’s steadfastly refusing to update to .21 that’s reported in another ticket, so I won’t upload the same logs over on that ticket.
  • The 4D .21 splash page indicated that services were down. I’ve attached logs taken at that time, and then again after reboot.
  • The Boom log is for the instrument that is always happy with life :wink:

RSH.R3A9B.2026-01-28T01_28_43.logs.tar (4.7 MB)
RSH.R4EFC.2026-01-28T01_27_31.logs.tar (4.8 MB)
RSH.R4EFC.2026-01-28T01_29_57.logs.tar (4.5 MB)
RSH.R5FE0.2026-01-28T01_29_07.logs.tar (4.0 MB)

Thank you for all the logs wiley42, they were very informative!

Indeed, the RBOOM is just coasting through its great life; the others should take an example!

Going into what I’ve found:

  • all Shakes display (every now and then) a disconnection from the NTP time synchronization server. This doesn’t usually last long, and the units always reconnect to the server, so it could be related to the Starlink connection (with all three Shakes together)
  • other than this, the RBOOM is fine
  • however, both the RS3D and the RS4D show some file corruption from the logs. And for the 3D, this is likely the reason why the Shake doesn’t manage to update on its own

Thus, for both the 3D and the 4D, I recommend re-burning the microSD card. You can find instructions on how to do so here.

If the microSD cards continue to show inconsistent behavior, then it may be worth using different ones.

Once the reburning is done, simply reinsert the cards into the Shakes, turn them on, and then everything should be fine now.

In any case, I am always available.

Thank you for the prompt analysis and reply!

I have local stratum-one NTP servers, so I’ll just whack all three of them to point to one of those rather than whatever pool they’re pointing to by default. I’ll need to pull the 3D and 4D out of their vault and I’m traveling most of next week, so I’l update my observations sometime the week of the 8th. Hopefully a couple of new mlc cards and refreshed images will make them as happy as the RBOOM :slight_smile:

Thanks again for the support!

1 Like

So, finally got around to reflashing the R3D and 4D.

Good news, they properly updated themselves to 0.21.1, connected to the servers, and think they’re contributing data.

Bad news, they still end up eventually dropping off of Station View.

They, and the RBoom, are all showing healthy NTP peering.

1 Like

Replying to my own message is bad form but, well, here I go.

Turns out the 4D eventually decided to report that local services were down. Logs are attached (hopefully).
RSH.R4EFC.2026-02-15T18_56_21.logs.tar (2.9 MB)

Hello wiley42,

Yes, the logs were correctly attached, thank you.

This (Monday) UTC morning I’m seeing all three units not transmitting live data, so I don’t know if you have temporarily pulled them or if you’re having an outage.

However, while I can confirm (as you have seen) that the 4D has been correctly updated to v0.21.1, the logs still show continued NTP errors such as the one below:

2026 046 10:38:34>>	Time adjustment M0: HARD RESET.  This will result in a one-time time-tear.

The average is approximately once every 1-2 minutes, so the NTP connection (or the overall internet connection) behaves quite erratically, and the result is stations dropping out of StationView.

If possible, it could be interesting to see what happens using a standard internet connection (non-Starlink one) to see if we can isolate the possible cause.

Also, if you can attach the logs from the other two instruments you have, I can confirm if this is the reason why all three are not reporting, or if there are other factors at play.

I’m increasingly skeptical that loss of NTP sync explains the issue, given that all three units are on the same switch, with the same backhaul, to the same switch, to the same Starlink terminal. It also doesn’t explain why the 4D’s local status page occasionally reports that services have died.

If I had an alternative, I wouldn’t be using Starlink :wink:

Now, on the NTP front, I did observe that the Rboom was configured to use a different pool than the 3D and 4D. However, to factor this mess out entirely, all three are now talking to a pair of stratum one servers installed on the same switch as the Shakes; all three have sync with the local stratum-one servers as well as various pool servers. Currently the 3D and 4D are reporting data locally and claim connections to the server, but are not reporting on Station View; the Boom is showing a server connection and is reporting on Station View, but is showing nothing in the local activity preview…

Logs attached.
RSH.R5FE0.2026-02-16T20_50_11.logs.tar (2.0 MB)
RSH.R4EFC.2026-02-16T20_50_25.logs.tar (4.2 MB)
RSH.R3A9B.2026-02-16T20_50_40.logs.tar (7.0 MB)

Thank you for the additional logs, wiley42.

I have been monitoring your Shakes for the past 24 hours or so, trying to gather more information about your situation:

  • The Boom continued without issues, as far as I could see
  • The 3D appears to be behind real-time, but is slowly catching up
  • The 4D, instead, has transmitted for a bit, and then seemingly stopped

So, overall, at the logs snapshot, the Shakes were connected to the server and could communicate with it. I have asked our server team to check things on our side, and I’ll let you know if they find anything.

In the meantime, could I ask you to SSH into the Shake and execute these commands without rebooting the instruments or your Starlink router?

ping -c 5 raspberryshakedata.com
traceroute raspberryshakedata.com
nc -zvw5 raspberryshakedata.com 55555
nc -zvw5 raspberryshakedata.com 55556

Thank you so much for your collaboration!

4D:
myshake@raspberryshake:/opt $ ping -c 5 raspberryshakedata.com
PING raspberryshakedata.com (65.21.129.27) 56(84) bytes of data.
64 bytes from static.27.129.21.65.clients.your-server.de (65.21.129.27): icmp_seq=1 ttl=41 time=188 ms
64 bytes from static.27.129.21.65.clients.your-server.de (65.21.129.27): icmp_seq=2 ttl=41 time=187 ms
64 bytes from static.27.129.21.65.clients.your-server.de (65.21.129.27): icmp_seq=3 ttl=41 time=189 ms
64 bytes from static.27.129.21.65.clients.your-server.de (65.21.129.27): icmp_seq=4 ttl=41 time=190 ms
64 bytes from static.27.129.21.65.clients.your-server.de (65.21.129.27): icmp_seq=5 ttl=41 time=192 ms

raspberryshakedata.com ping statistics —
5 packets transmitted, 5 received, 0% packet loss, time 139ms
rtt min/avg/max/mdev = 186.624/189.047/191.806/1.686 ms
myshake@raspberryshake:/opt $ traceroute raspberryshakedata.com
traceroute to raspberryshakedata.com (65.21.129.27), 30 hops max, 60 byte packets
1 router.mainecoon.com (192.168.88.1) 4.066 ms 4.118 ms 4.248 ms
2 10.10.1.1 (10.10.1.1) 26.794 ms 27.046 ms 26.509 ms
3 157-131-42-161.dedicated.static.sonic.net (157.131.42.161) 34.617 ms 34.893 ms 34.852 ms
4 ae5.cr3.snrtcamn.sonic.net (157.131.242.202) 35.021 ms 27.009 ms 27.247 ms
5 ae2.cr4.snrtcamn.sonic.net (192.184.185.202) 34.687 ms 34.642 ms 34.603 ms
6 ae2.cr2.ptlmca01.sonic.net (192.184.185.177) 38.305 ms 30.809 ms 30.484 ms
7 * * *
8 * * *
9 * * *
10 * * *
11 * * *
12 * * *
13 100.ae1.nrd1.equinix-sj.sonic.net (75.101.33.185) 25.008 ms 34.197 ms 30.176 ms
14 sjo-b23-link.ip.twelve99.net (62.115.182.54) 46.511 ms 38.123 ms 46.428 ms
15 palo-bb2-link.ip.twelve99.net (62.115.139.116) 46.982 ms 46.938 ms 46.896 ms
16 den-bb2-link.ip.twelve99.net (62.115.139.113) 70.198 ms 70.419 ms 70.379 ms
17 kanc-bb2-link.ip.twelve99.net (62.115.137.115) 198.395 ms 198.109 ms kanc-bb2-link.ip.twelve99.net (62.115.140.184) 201.904 ms
18 chi-bb2-link.ip.twelve99.net (62.115.136.102) 198.265 ms 198.787 ms 184.057 ms
19 ewr-bb2-link.ip.twelve99.net (62.115.132.134) 99.932 ms 100.034 ms *
20 kbn-bb6-link.ip.twelve99.net (80.91.254.90) 186.634 ms 183.232 ms 191.088 ms
21 sto-bb2-link.ip.twelve99.net (62.115.139.172) 183.118 ms 191.393 ms hbg-bb2-link.ip.twelve99.net (62.115.122.160) 203.032 ms
22 kbn-bb6-link.ip.twelve99.net (62.115.135.226) 191.304 ms sto-b9-link.ip.twelve99.net (62.115.139.187) 199.534 ms hls-b4-link.ip.twelve99.net (62.115.123.203) 199.221 ms
23 sto-bb2-link.ip.twelve99.net (62.115.139.172) 194.998 ms hls-b6-link.ip.twelve99.net (62.115.137.134) 198.723 ms 198.819 ms
24 hetzner-ic-376888.ip.twelve99-cust.net (213.248.95.243) 199.052 ms 202.696 ms 199.239 ms
25 core32.hel1.hetzner.com (213.239.224.26) 194.965 ms core31.hel1.hetzner.com (213.239.224.38) 198.720 ms hls-b6-link.ip.twelve99.net (62.115.137.134) 191.193 ms
26 hetzner-ic-376888.ip.twelve99-cust.net (213.248.95.243) 195.099 ms ex9k2.dc3.hel1.hetzner.com (213.239.245.114) 179.001 ms 198.320 ms
27 * * *
28 * ex9k2.dc3.hel1.hetzner.com (213.239.245.114) 182.349 ms *
29 * * *
30 * * *
myshake@raspberryshake:/opt $ nc -zvw5 raspberryshakedata.com 55555
Connection to raspberryshakedata.com 55555 port [tcp/] succeeded!
myshake@raspberryshake:/opt $ nc -zvw5 raspberryshakedata.com 55556
Connection to raspberryshakedata.com 55556 port [tcp/
] succeeded!

For good measure:
myshake@raspberryshake:/opt $ ntpq
ntpq> peers
remote refid st t when poll reach delay offset jitter

*tick.mainecoon. .PPS. 1 u 39 1024 377 0.402 -2.973 0.401
+tock.mainecoon. .PPS. 1 u 857 1024 377 0.394 -2.670 0.476
+96-19-94-82.cpe .GPS3. 1 u 682 1024 377 95.369 -1.190 2.013
-clock.nyc.he.ne 66.220.9.122 3 u 996 1024 377 92.927 0.657 1.836
-69-10-208-170.r 128.9.176.30 2 u 812 1024 377 46.515 -4.380 6.300

3D:
myshake@raspberryshake:/opt $ ping -c 5 raspberryshakedata.com
PING raspberryshakedata.com (65.21.129.27) 56(84) bytes of data.
64 bytes from static.27.129.21.65.clients.your-server.de (65.21.129.27): icmp_seq=1 ttl=41 time=184 ms
64 bytes from static.27.129.21.65.clients.your-server.de (65.21.129.27): icmp_seq=2 ttl=41 time=186 ms
64 bytes from static.27.129.21.65.clients.your-server.de (65.21.129.27): icmp_seq=3 ttl=41 time=190 ms
64 bytes from static.27.129.21.65.clients.your-server.de (65.21.129.27): icmp_seq=4 ttl=41 time=185 ms
64 bytes from static.27.129.21.65.clients.your-server.de (65.21.129.27): icmp_seq=5 ttl=41 time=190 ms

raspberryshakedata.com ping statistics —
5 packets transmitted, 5 received, 0% packet loss, time 10ms
rtt min/avg/max/mdev = 183.958/186.987/190.361/2.627 ms
myshake@raspberryshake:/opt $ traceroute raspberryshakedata.com
traceroute to raspberryshakedata.com (65.21.129.27), 30 hops max, 60 byte packets
1 router.mainecoon.com (192.168.88.1) 4.019 ms 4.136 ms 4.085 ms
2 10.10.1.1 (10.10.1.1) 46.851 ms 38.753 ms 46.768 ms
3 * * *
4 ae5.cr3.snrtcamn.sonic.net (157.131.242.202) 47.227 ms 47.187 ms 47.149 ms
5 ae2.cr4.snrtcamn.sonic.net (192.184.185.202) 46.852 ms 46.810 ms 46.772 ms
6 ae2.cr2.ptlmca01.sonic.net (192.184.185.177) 46.727 ms 37.817 ms 37.566 ms
7 ae6.cr4.snrfca01.sonic.net (157.131.243.222) 4996.916 ms * *
8 * * *
9 * * *
10 * * *
11 * * *
12 * * *
13 100.ae1.nrd1.equinix-sj.sonic.net (75.101.33.185) 43.909 ms 43.943 ms 43.904 ms
14 sjo-b23-link.ip.twelve99.net (62.115.182.54) 43.784 ms 43.744 ms 43.705 ms
15 palo-bb2-link.ip.twelve99.net (62.115.139.116) 51.258 ms 43.706 ms 53.421 ms
16 den-bb2-link.ip.twelve99.net (62.115.139.113) 74.950 ms 75.383 ms 59.039 ms
17 kanc-bb2-link.ip.twelve99.net (62.115.140.184) 198.543 ms kanc-bb2-link.ip.twelve99.net (62.115.137.115) 191.757 ms 203.738 ms
18 chi-bb2-link.ip.twelve99.net (62.115.136.102) 203.707 ms * 203.629 ms
19 * * *
20 kbn-bb6-link.ip.twelve99.net (80.91.254.90) 184.244 ms 181.437 ms 179.996 ms
21 sto-bb2-link.ip.twelve99.net (62.115.139.172) 188.420 ms 184.144 ms 189.230 ms
22 hls-b4-link.ip.twelve99.net (62.115.123.203) 188.718 ms 195.712 ms kbn-bb6-link.ip.twelve99.net (62.115.135.226) 181.120 ms
23 hls-b6-link.ip.twelve99.net (62.115.137.134) 183.800 ms 190.582 ms 190.715 ms
24 hetzner-ic-376888.ip.twelve99-cust.net (213.248.95.243) 184.113 ms sto-b9-link.ip.twelve99.net (62.115.139.187) 191.754 ms hetzner-ic-376888.ip.twelve99-cust.net (213.248.95.243) 196.092 ms
25 hls-b6-link.ip.twelve99.net (62.115.137.134) 194.824 ms core31.hel1.hetzner.com (213.239.224.38) 194.593 ms hls-b6-link.ip.twelve99.net (62.115.137.134) 194.391 ms
26 ex9k2.dc3.hel1.hetzner.com (213.239.245.118) 187.555 ms ex9k2.dc3.hel1.hetzner.com (213.239.245.114) 187.146 ms hetzner-ic-376888.ip.twelve99-cust.net (213.248.95.243) 195.800 ms
27 * * hls-b6-link.ip.twelve99.net (62.115.137.134) 186.093 ms
28 * * *
29 * * *
30 * * *
myshake@raspberryshake:/opt $ nc -zvw5 raspberryshakedata.com 55555
Connection to raspberryshakedata.com 55555 port [tcp/] succeeded!
myshake@raspberryshake:/opt $ nc -zvw5 raspberryshakedata.com 55556
Connection to raspberryshakedata.com 55556 port [tcp/
] succeeded!
myshake@raspberryshake:/opt $ ntpq
ntpq> peers
remote refid st t when poll reach delay offset jitter

SHM(0) .GPS. 0 l - 16 0 0.000 0.000 0.000
SHM(1) .PPS. 0 l - 16 0 0.000 0.000 0.000
*tick.mainecoon. .PPS. 1 u 785 1024 377 0.271 -0.197 0.148
+tock.mainecoon. .PPS. 1 u 686 1024 377 0.276 -0.053 0.039
-time.cloudflare 10.207.8.4 3 u 781 1024 377 37.882 -3.426 5.208
-172-233-155-39. 40.119.6.228 4 u 686 1024 377 39.881 -1.805 2.389
+66.42.71.197 (6 132.239.1.6 2 u 920 1024 377 48.034 2.305 2.682

Boom:
myshake@raspberryshake:/opt $ ping -c 5 raspberryshakedata.com
PING raspberryshakedata.com (65.21.129.27) 56(84) bytes of data.
64 bytes from static.27.129.21.65.clients.your-server.de (65.21.129.27): icmp_seq=1 ttl=41 time=191 ms
64 bytes from static.27.129.21.65.clients.your-server.de (65.21.129.27): icmp_seq=2 ttl=41 time=186 ms
64 bytes from static.27.129.21.65.clients.your-server.de (65.21.129.27): icmp_seq=3 ttl=41 time=200 ms
64 bytes from static.27.129.21.65.clients.your-server.de (65.21.129.27): icmp_seq=4 ttl=41 time=203 ms
64 bytes from static.27.129.21.65.clients.your-server.de (65.21.129.27): icmp_seq=5 ttl=41 time=190 ms

raspberryshakedata.com ping statistics —
5 packets transmitted, 5 received, 0% packet loss, time 9ms
rtt min/avg/max/mdev = 185.877/194.105/203.103/6.517 ms
myshake@raspberryshake:/opt $ traceroute raspberryshakedata.com
traceroute to raspberryshakedata.com (65.21.129.27), 30 hops max, 60 byte packets
1 router.mainecoon.com (192.168.88.1) 9.453 ms 9.352 ms 9.448 ms
2 10.10.1.1 (10.10.1.1) 50.913 ms 51.324 ms 51.284 ms
3 157-131-42-161.dedicated.static.sonic.net (157.131.42.161) 62.937 ms 64.101 ms 64.062 ms
4 ae5.cr3.snrtcamn.sonic.net (157.131.242.202) 51.572 ms 51.521 ms 50.837 ms
5 ae2.cr4.snrtcamn.sonic.net (192.184.185.202) 141.459 ms 141.677 ms 141.637 ms
6 ae2.cr2.ptlmca01.sonic.net (192.184.185.177) 50.671 ms 36.180 ms 36.062 ms
7 * * *
8 * * *
9 * * *
10 * * *
11 * * *
12 * * *
13 100.ae1.nrd1.equinix-sj.sonic.net (75.101.33.185) 43.205 ms 43.842 ms 34.666 ms
14 sjo-b23-link.ip.twelve99.net (62.115.182.54) 42.788 ms 46.945 ms 46.612 ms
15 palo-bb2-link.ip.twelve99.net (62.115.139.116) 47.209 ms 46.782 ms 36.242 ms
16 den-bb2-link.ip.twelve99.net (62.115.139.113) 62.929 ms 63.426 ms 56.061 ms
17 kanc-bb2-link.ip.twelve99.net (62.115.137.115) 203.852 ms kanc-bb2-link.ip.twelve99.net (62.115.140.184) 203.810 ms kanc-bb2-link.ip.twelve99.net (62.115.137.115) 204.036 ms
18 chi-bb2-link.ip.twelve99.net (62.115.136.102) 196.457 ms 195.810 ms 195.499 ms
19 ewr-bb2-link.ip.twelve99.net (62.115.132.134) 99.700 ms 99.956 ms *
20 kbn-bb6-link.ip.twelve99.net (80.91.254.90) 183.318 ms 182.963 ms ldn-bb2-link.ip.twelve99.net (62.115.139.247) 195.951 ms
21 sto-bb2-link.ip.twelve99.net (62.115.139.172) 183.396 ms * 180.361 ms
22 hls-b4-link.ip.twelve99.net (62.115.123.203) 183.619 ms 179.798 ms sto-b9-link.ip.twelve99.net (62.115.139.187) 187.565 ms
23 hls-b6-link.ip.twelve99.net (62.115.137.134) 179.666 ms * *
24 hetzner-ic-376888.ip.twelve99-cust.net (213.248.95.243) 187.348 ms 187.034 ms 187.191 ms
25 hls-b6-link.ip.twelve99.net (62.115.137.134) 187.921 ms core31.hel1.hetzner.com (213.239.224.38) 191.815 ms hls-b6-link.ip.twelve99.net (62.115.137.134) 184.491 ms
26 ex9k2.dc3.hel1.hetzner.com (213.239.245.118) 188.306 ms hetzner-ic-376888.ip.twelve99-cust.net (213.248.95.243) 187.590 ms 185.122 ms
27 core31.hel1.hetzner.com (213.239.224.38) 192.984 ms * *
28 ex9k2.dc3.hel1.hetzner.com (213.239.245.118) 185.278 ms * *
29 * * *
30 * * *
myshake@raspberryshake:/opt $ nc -zvw5 raspberryshakedata.com 55555
Connection to raspberryshakedata.com 55555 port [tcp/] succeeded!
myshake@raspberryshake:/opt $ nc -zvw5 raspberryshakedata.com 55556
Connection to raspberryshakedata.com 55556 port [tcp/
] succeeded!
myshake@raspberryshake:/opt $ ntpq
ntpq> peers
remote refid st t when poll reach delay offset jitter

SHM(0) .GPS. 0 l 10 16 377 0.000 408.520 1.569
SHM(1) .PPS. 0 l - 16 0 0.000 0.000 0.000
*tick.mainecoon. .PPS. 1 u 170 512 377 0.292 0.992 2.624
+tock.mainecoon. .PPS. 1 u 541 512 377 0.274 -0.822 1.742
+usscz2-ntp-002. .GPSs. 1 u 60 64 377 22.750 2.338 8.135

At the moment, all three are up, all three are finding each other on the splash screen, but only the Boom’s data is being reported by Station View. Rebooting the 3D/4D will result in their data being reported (despite an NTP solution not yet being computed), but eventually they fall off. It’s a bit like the wire protocol doesn’t recover nicely from network interruptions.

As ever, thanks for the help!

Thank you for all the command-line reports, wiley42.

Our server team has checked things on our side, and I can report that there are no server-side issues at the moment. So, we need to focus on why the connection is behaving in this way.

As reflected client-side (your logs), our servers get many data transmission timeouts that can also be seen from all the broken pipeline lines in the logs. This means that the Shake is actively trying to transmit, but then the interruptions you’ve seen happen.

Let’s try this test by checking what happens when the Shakes are connectet singularly:

  • Disconnect the 3D and 4D, and leave the Boom on for 24h
  • Then disconnect it, and reconnect only the 3D for 24h
  • And, last, disconnect that and connect only the 4D for 24h

This is to check if the data volume that the three instruments are trying to update is behind what we are seeing. We already know that the Boom works fine, but it will be interesting to see the result of the experiment above.

Additionally, another element you can check (if you haven’t already) is this.

No trouble at all; you’re more than welcome!

Yep; interruptions are expected; the vault is located in mountainous terrain with visibility to the north partially obscured by trees and rising terrain, which means transient drop-outs, up to several minutes in duration, are normal. There’s a 40 meter tower waiting to be erected that should get the terminal above the trees (if only just), but there’s not much we can do about terrain but hope that build-out of the constellation eventually fills the gaps.

Currently, everything in the vault is down thanks to storms and the UPS eventually running dry. Everything is under two meters of snow, but clearing is expected soon, at which point we’ll excavate things and get back on the air – at which point we’ll run the proposed experiment.

While the Shakes are backhauling via Starlink, it’s not “naked” Starlink; they live on a RFC1918 network with a firewall at the edge that builds a tunnel via Starlink to a data center in a more -er- civilized part of the world. This gives us a way around the sometimes unexpected behavior of the Starlink terminal, and allows us to have a fixed external IP address.

I’ll be back once I have more data to share. As ever, thank you for the support!

1 Like

Quite understandable, thank you for all the additional details wiley42!

Probably, the additional height provided by the tower will address most of the issues, as the terminal should also be able to connect to more satellites (while waiting for the network up there to densify).

It’s no trouble at all. If you need further assistance when the snow melts, just reach out to us!