©XSIBackup-Free: Free Backup Software for ©VMWare ©ESXi

Forum ©XSIBackup: ©VMWare ©ESXi Backup Software


You are not logged in.

#1 Re: General matters » After restore of VM XSIBackup fails, cannot find the vmsd file » 2021-11-01 07:50:15

Yes. You are right. That is what I did eventually. Noted what disks were defined and their files and copied just those and the VMX file to another diectory. That fixed it.

#2 Re: General matters » After restore of VM XSIBackup fails, cannot find the vmsd file » 2021-10-29 12:30:18

OK. smile I also noted that there was a disk image that has not been touched in months there and also does not seem to be defined in the VM properties. I guess I will make a backup of that file and remove it. This particular VM is an "appliance". You upgrade by moving the data disk to a new VM. There is always a chance that you forget to delete some old files.

#3 Re: General matters » After restore of VM XSIBackup fails, cannot find the vmsd file » 2021-10-29 11:13:22

Thanks! No I would never dream of renaming any other way than through the Web UI and that takes care of renaming all other files as well, but anyway. There were two files that retained the old name, that one and an XML-file that just had <config /> in it. Shut down the VM. Renamed those two and restarted. Backup is now running......

#4 General matters » After restore of VM XSIBackup fails, cannot find the vmsd file » 2021-10-29 06:25:50

AndersG
Replies: 6

XSIBACKUP-PRO 11.2.19

This time ESXIBackup saved my bacon smile An upgrade that should have been trivial failed so I had to revert to the backup VM on the backup server. Copied it to the primary server into another directory. The old VM was called Filr4_ny and the copy that I now run and it runs without issues is called Filr4_2021. problem is that ESXIBackup fails with:

cat: can't open '/vmfs/volumes/datastore1/Filr4/Filr4.vmsd
/vmfs/volumes/datastore1/Filr4/Filr4_2021.vmsd': No such file or directory


Ie, it is looking for the wrong file. I do not know from where it picks up that name so how do I correct it? I tried  renaming the VM to match, but it did not help.


[root@ESXI34:~] ls /vmfs/volumes/datastore1/Filr4/
Filr4-Snapshot49.vmsn             Filr4_1-flat.vmdk                 Filr4_2021.vmsd                   vmware-1.log
Filr4-aux.xml                     Filr4_1.vmdk                      Filr4_2021.vmx                    vmware-2.log
Filr4-flat.vmdk                   Filr4_2-flat.vmdk                 Filr4_2021.vmx.lck                vmware.log
Filr4.nvram                       Filr4_2.vmdk                      Filr4_2021.vmx~                   vmx-Filr4_2021-3005619324-1.vswp
Filr4.vmdk                        Filr4_2021-aux.xml                vastorage-flat.vmdk
Filr4.vmsd                        Filr4_2021-b3261c7c.vswp          vastorage.vmdk

#5 Re: General matters » Backup problems, SSH not connecting after power outage - Solved » 2021-08-12 09:47:51

Thanks for that explanation, makes sense. Anyway, our ESX hosts are not exposed to the outside world. Only hosts that need to, ie webservers and stuff and never port 22

#6 General matters » Backup problems, SSH not connecting after power outage - Solved » 2021-08-12 08:02:24

AndersG
Replies: 2

Solved. See below, but leaving it as it might help others.

XSIBACKUP-PRO 11.2.19 backing up VMs from one VMWare ESXi 6.7.0 build-14320388  server to another.

Yesterday did we have a longish power outage, longer than the capacity of our UPS so I shut down both VMWare server throgh the console, just as a precaution. Today, the night's backup of ESXI34 to ESXI36 failed. The actual error message in the log is:


The server --backup-point=10.0.20.36:22:/vmfs/volumes/datastore1 does not exist
Make sure there is an SSH server listening on 10.0.20.36 port 22 and that the fi rewall allows this connection

Nothing, that I know of, has changed here and I can connect to .36 from other machines just fine, but if I try:

ssh root@10.0.20.36

On .34 It just times out. The same command from another machine works as expected.

[root@ESXI34:~] ssh root@10.0.20.36
ssh: connect to host 10.0.20.36 port 22: Connection timed out

Both machines are connected to the same switch.

I also noted that SSH had been disabled, mysteriously on ESXI36, but I enabled that. Could it be that the linking of the servers has been broken and I should reestablish that?


[root@ESXI34:~] /scratch/XSI/XSIBackup-Pro/xsibackup --link-srv=10.0.20.36

###############################################################################
#
# (c) XSIBACKUP-PRO 11.2.19 | Backup for (c) VMWARE ESXi Hypervisor by 33hops.com
#
###################################################################################

NOTICE: (c) XSIBackup kills any user launched jobs, make sure you don't overlap manual jobs
----------------------------------------------------------------------------------------------------------
XSIBackup PID:          2117809                                                ESXI34.pedago.fi
Enabling port 22 out in the ESXi firewall...
----------------------------------------------------------------------------------------------------------
There is already an RSA public key at /scratch/XSI/XSIBackup-Pro/xsibackup_id_rsa.pub. We will use the existing one.
----------------------------------------------------------------------------------------------------------
Enter the remote server root password when prompted.
----------------------------------------------------------------------------------------------------------
Enter remote systems's password for user root, checking OS type...
----------------------------------------------------------------------------------------------------------
The authenticity of host '10.0.20.36 (10.0.20.36)' can't be established.
RSA key fingerprint is SHA256:E8+cQmrqe4L9aTVZdQK2VjiY+aqymSETIBbUZqoo4Lc.
Are you sure you want to continue connecting (yes/no)? yes
Password:
Enter root password again to look for its authorized_keys file...
----------------------------------------------------------------------------------------------------------
Password:
Enter root remote password to iterate existing keys...
----------------------------------------------------------------------------------------------------------
Password:
Almost done, enter root password to add RSA key and restart SSH service...
----------------------------------------------------------------------------------------------------------
Password:
The RSA key has been added to the authorized_keys file at 10.0.20.36.
----------------------------------------------------------------------------------------------------------
Rebooting local SSH service...
SSH login disabled
SSH login enabled
Rebooting remote SSH service...
----------------------------------------------------------------------------------------------------------
Killed
[root@ESXI34:~]

Yes. That was it. Fixed! Something had happened so that the ESXi server had lost its link

#7 Re: General matters » Installation questions. » 2021-05-24 12:54:21

OK. Noted. FWIW has your software worked flawlessly since September last year, but we will evaluate an upgrade.

#8 Re: © OneDiff » Backup of one VM fails, every other day. » 2020-12-28 07:39:26

"Are you quoting paths with spaces?"

No, it is just that the VM in question had one disk where the name had a space and that was VMWare's doing, not mine. The name was automatically generated somewhere back in the VMWare 4.x days when the appliance was installed. Thus all I did was to create a job to backup that VM

I have been doing this for a very long time, since the CP/M days and I would never myself put a space in a filename, use accented characters in a filename, password or whatever as it is a sure way of breaking things along the way smile


It could very well be what you say that this is a bug in that particular build of VMWare and it annoys the heck out of me that I did not spot that earlier. Would have saved me a lot of work.

#9 Re: © OneDiff » Backup of one VM fails, every other day. » 2020-12-27 15:02:35

"(c)XSIBackup-Pro suppports spaces in file names."

Any yet it would, on every second go complain that the file was not there:

2020-12-25T18:14:03.596Z info hostd[2098563] [Originator@6876 sub=DiskLib opID=esxui-8168-138b user=root] DISKLIB-DSCPTR: DescriptorOpenInt: failed to open '/vmfs/volumes/5f97f8f4-7790c2f6-ea6b-74867aed9da0/Filr4/Novell Filr-2_1.vmdk': Could not find the file (60002)

Instead of the two files, there would be just one, named "Novell.vdmk"

I could clearly see that it cut off the name after the space. 100% repeatable. Could it have been a limitation of RSync?

Anyway. It works after the rename.

#10 Re: © OneDiff » Backup of one VM fails, every other day. » 2020-12-27 08:35:35

Oh, yes. To rename a VMWare disk file:
1. Shut down VM
2. Remove all snapshots
3. Note which disk file goes where in the VM
4. Remove the offending disk and all disks after. DO NOT DELETE FILES!
5. cd into the directory where the file is
6. vmkfstools -E "My Bad File.vmdk" MyBadFile.vmdk
7. Add files back in correct order

VMWare disks are always two files file.vmdk and file-flat.vmdk. Renaming file.vmdk will rename the flat file automatically.

#11 Re: © OneDiff » Backup of one VM fails, every other day. » 2020-12-27 08:23:34

OK. Here is the tl;dr summary:

One of the disk files in that VM had a space in the filename.

The long story:
The VM is an appliance called "Filr". It is a file sync/publishing appliance. A little bit like Sharepoint/OneDrive, but with the added benefit that files are in-house so you can be GDPR-compliant.

We have been using this for a long time and the VM probably started out running on VMWare 4. It has been upgraded several times, but from version 2 and forward you upgraded by installing a new appliance and added the data disk from the old. Thus, the data disk kept the name it was asutomatically given when the Filr2 installation was created "Novell Filr-2_1.vmdk".

This worked fine for XSIBackup when running from disk A to B on the same host, but not from one host to another. The transfer ended up first transferring "Novell Filr-2_1.vmdk" as "Novell.vmdk" then came "Novell Filr-2_1-flat.vmdk" and overwrote as "Novell.vmdk".

My initial mistake when troubleshooting was that I looked at the hostd.log on the source, when I should have looked at the target. I misunderstood the wording of the error message: "ERROR DIFDELAL, details: [Filr4] error: error deleting all snapshots VM [Filr4_XSIBAK] Id 29, error: Remove All Snapshots". When I read it now I definitely see that it uniquely points to the target, but then again is hindsight always 20-20.

Hope this helps someone else!

#12 Re: © OneDiff » Backup of one VM fails, every other day. » 2020-12-26 17:43:33

I think I have found the core issue now. Been working on it all day. I will post back when I have tested a little bit more. If it is what I think it is, then it would explain why it worked when copying on the same server to another disk, but not between servers. And fil on just one VM at that.


Yes, the  two ESXi hosts have identical versions. Can you elaborate on:

"You can't use Onediff between any two given ESXi hosts."

Do you mean: "The source and target ESXi hosts need to be the same version"?

#13 Re: © OneDiff » Backup of one VM fails, every other day. » 2020-12-25 21:05:57

I strongly suspect that the issue is with one of the "disks" of that VM.

#14 Re: © OneDiff » Backup of one VM fails, every other day. » 2020-12-25 18:21:40

OK. Same thing. When I look at the other VMs on the target server, they are all without snapshots, but the problem one (Filr4) has a snapshot on it, and it cannot be removed through the UI either. the error is that it complains about file Novell Filr-2_1-000001.vmdk, it is looking for a parent, Novell Filr-2_1.vmdk, but there is just a Novell Filr-2_1-flat.vmdk

#15 Re: © OneDiff » Backup of one VM fails, every other day. » 2020-12-25 09:28:15

So, if I look at hostd.log on the target, I definitely see things:

2020-12-25T09:15:33.297Z info hostd[2099162] [Originator@6876 sub=Libs opID=vim-cmd-2d-b471 user=root] OBJLIB-FILEBE : FileBEOpen: can't open '/vmfs/volumes/5f97f8f4-7790c2f6-ea6b-74867aed9da0/Filr4/Novell Filr-2_1.vmdk' : Could not find the file (393218).
2020-12-25T09:15:33.297Z info hostd[2099162] [Originator@6876 sub=DiskLib opID=vim-cmd-2d-b471 user=root] DISKLIB-DSCPTR: DescriptorOpenInt: failed to open '/vmfs/volumes/5f97f8f4-7790c2f6-ea6b-74867aed9da0/Filr4/Novell Filr-2_1.vmdk': Could not find the file (60002)
2020-12-25T09:15:33.298Z info hostd[2099162] [Originator@6876 sub=DiskLib opID=vim-cmd-2d-b471 user=root] DISKLIB-LINK  : "/vmfs/volumes/5f97f8f4-7790c2f6-ea6b-74867aed9da0/Filr4/Novell Filr-2_1.vmdk" : failed to open (The system cannot find the file specified).
2020-12-25T09:15:33.298Z info hostd[2099162] [Originator@6876 sub=DiskLib opID=vim-cmd-2d-b471 user=root] DISKLIB-CHAIN :"/vmfs/volumes/5f97f8f4-7790c2f6-ea6b-74867aed9da0/Filr4/Novell Filr-2_1-000002.vmdk": Failed to open parent "/vmfs/volumes/5f97f8f4-7790c2f6-ea6b-74867aed9da0/Filr4/Novell Filr-2_1.vmdk": The system cannot find the file specified.
2020-12-25T09:15:33.298Z info hostd[2099162] [Originator@6876 sub=DiskLib opID=vim-cmd-2d-b471 user=root] DISKLIB-CHAIN : "/vmfs/volumes/5f97f8f4-7790c2f6-ea6b-74867aed9da0/Filr4/Novell Filr-2_1.vmdk" : failed to open (The parent of this virtual disk could not be opened).
2020-12-25T09:15:33.298Z info hostd[2099162] [Originator@6876 sub=DiskLib opID=vim-cmd-2d-b471 user=root] DISKLIB-LIB   : Failed to open '/vmfs/volumes/5f97f8f4-7790c2f6-ea6b-74867aed9da0/Filr4/Novell Filr-2_1-000002.vmdk' with flags 0x3 The parent of this virtual disk could not be opened (23).


So, why this Filr4/Novell Filr-2_1-000002.vmdk? That file exists on the target but not on source. Also present on the target is several Filr4-Snapshotnnnn files.

I also see the remove snapshot erroro on the target server in the UI. I will move the datastore folder on the target to another disk and retry the backup operation

#16 Re: © OneDiff » Backup of one VM fails, every other day. » 2020-12-24 13:44:44

But hey! Have I been looking in the wrong place?

[Filr4] info: deleting snapshot at [Filr4_XSIBAK] Id 29...
[Filr4] error DIFDELAL: error deleting all snapshots VM [Filr4_XSIBAK] Id 29, details: Remove All Snapshots:
Remove all snapshots failed


This is removing the snapshot on the remote copy, right?

#17 Re: © OneDiff » Backup of one VM fails, every other day. » 2020-12-24 13:41:36

Tried first with the service shut down (in between preparing Christmas dinner). Fails on every second backup. Then turned off the VM completely. Still fails, so the error must be somewhere else?

#18 Re: © OneDiff » Backup of one VM fails, every other day. » 2020-12-24 09:13:58

Thanks and yes. I have had too little time to troubleshot this properly, but now I will have ample time over christmas smile Anyway. Step one is to break out this backup as a separate job and run it at another time. That way it will be so much easier to spot differences. Will post my findings here.

So, have a very nice christmas and a happy new year!

#19 Re: © OneDiff » Backup of one VM fails, every other day. » 2020-12-23 12:50:05

The server in Question is for file sync. It is only accessed by clients. I will start by breaking out the backup to a separate job at another time which will make comparing logs easier. I will get back to you if I cannot solve this on our own.

#21 Re: © OneDiff » Backup of one VM fails, every other day. » 2020-12-23 12:14:14

Well I see the SoapAdapter.HTTPService error both for failed backups and backups that run OK. Also is the fact that it happens exactly every other day interesting. There is no activity that I know of at this time, but I could change the time and see if it makes any difference.

Would it help if I mailed you the log?

#22 General matters » XSI Pro to DC upgrade procedure » 2020-12-23 08:45:24

AndersG
Replies: 2

I am pondering whether to upgrade from Pro to DC and was wondering if there is any documentation on the actual prcedure? Ie can existing jobs be used or migrated to DC?

#23 Re: © OneDiff » Backup of one VM fails, every other day. » 2020-12-22 17:21:52

I could really use some help here. It is absolutely repeatable. It fails every other day. Is there ano other logs I could look at? In hostd I see an entry "Task Created : haTask-24-vim.VirtualMachine.removeAllSnapshots" on the days when it works, but not on the days it does not. On days when it fails the first access for that VM seems to be "Task Created : haTask-24-vim.ManagedEntity.reload"

#24 Re: © OneDiff » Backup of one VM fails, every other day. » 2020-12-12 12:22:03

OK. When looking at hostd.log and comparing to good days, I see:

2020-12-12T05:37:04.627Z error hostd[2099433] [Originator@6876 sub=SoapAdapter.HTTPService] Failed to read request; stream: <io_obj p:0x000000868a48d8d0, h:-1, <TCP '127.0.0.1 : 8307'>, <TCP '127.0.0.1 : 36575'> FD Closed>, error: N7Vmacore16TimeoutExceptionE(Operation timed out: Stream: <io_obj p:0x000000868a48d8d0, h:-1, <TCP '127.0.0.1 : 8307'>, <TCP '127.0.0.1 : 36575'> FD Closed>, duration: 00:00:47.668509 (hh:mm:ss.us))
--> [context]zKq7AVICAgAAAASD2gALaG9zdGQAACxONmxpYnZtYWNvcmUuc28AALC+GwCQnBcAQcwaAPjRLAAVaSkAteUoAJPpKACrxDYBO30AbGlicHRocmVhZC5zby4wAAJ9nw5saWJjLnNvLjYA[/context]
2020-12-12T05:37:09.698Z error hostd[5224387] [Originator@6876 sub=Default] [LikewiseGetDomainJoinInfo:354] QueryInformation(): ERROR_FILE_NOT_FOUND (2/0):

#25 Re: © OneDiff » Backup of one VM fails, every other day. » 2020-12-01 13:14:55

Well, I posted the relevant snippet from the log earlier smile Anyway. The error might be related to something stopping ESXi from quiescing the machine. I will investigate further.

And Yes now that we have narrowed it down, I can troubleshoot further.

Board footer