Just crunching the numbers out loud.
Sound travels roughly 1,100 feet per second +/- 100 feet per second depending on factors (temp / air pressure). Rough guessing it here, but should be within 10% or 20% of the real world numbers.
4' interruption / reflection point. So 4/1100 seconds till sound hits the wall, assuming that it originated from the other wall. So real world-ish (center room) more like 2/1100 Although 4/1100 for the second bounce. Fastest time wise, the corners are different timing / distance wise (aka muddy). a^2+b^2=c^2, or 4^2+4^2=distance^2, or 16+16=distance^2, or distance = 5.65-ish feet aka square root of 32. Corner to corner.
So at what Hz would a full wave (center to peak to center to valley and back to center) be the length of the distance wall to wall? 1100/4 or 1100/2 or 1100/5.65 or 1100/2.825 or
275 Hz or 550 Hz or 195 Hz or 389 Hz. Lowest guy there is 195 Hz, just below one octave below tuning A(220Hz), with A(440Hz) below middle C being tuning A. I hope you don't play Tuba or String Bass. 60Hz which is that audible hum we get from some gear on A/C power is well below that too.
Basically not an ideal space. But we already knew that. What could be done? Seal and pressurize it to change the speed of sound? Just how long until your client suffocates though? Heat the space to a couple hundred F as in Fahrenheit (or freeze it to 0 kelvin)? Likely well beyond the specs of the gear, and probably not much better for the client either. Without changing the space, you pretty much need to throw away all frequencies below 200Hz. Which leaves what? Tap dancers and castanet players to record in your space? Or Fran Drescher.
Not that you can't record those lower frequencies (which aren't all that low) in that space. But it's going to sound off to the performer. Who might have to force out what might otherwise come naturally in a different space. Or be so alienated from what they're used to, to not even be able to perform at all. I had an experience like that once. Karaoke night at a club and the monitors were in phase reversal of the mic. I could hear me coming out of the bad sounding speakers, but I couldn't hear me coming out of my own mouth. Freaky weird that first time out. And that was without the aid of libations.