Ok, direct answers to direct questions

.
Hardware.
1)You'll need a good computer with as much CPU, memory and hard drive as you can afford.
2)You'll need a multi-channel line input device to get the audio into the computer. I recommend
the MOTU 1224 or 2408 core kit which will give you a PCI card and an 8-in 8-out analog input device that also has some digital connections (depending on which model you go for). You can add up to two more 2408 or 1224 devices that connect to the same PCI card. There are other good devices, just search around on these forums.
3) You'll need some kind of mic preamp to create line signals to send to the MOTU (or other) input device. That's where the mixer comes in. I like using a full mixer for the versatility, but you could use just dedicated mic preamps. If you go for a mixer you'll want to make sure it has direct line outs for each channel, which will plug into the input device to the computer. I use
an Alesis Studio 32.
4) If you get powered monitors you won't need an amp.
If you get a good, high quality input device you shouldn't have any pops or clicks or driver conflicts. The installation of my MOTU 1224 went without a hitch and it's performed flawlessly for months now. If you don't put any other odd hardware in the machine there shouldn't be a problem.
I think the delay you mentioned is going to be there no matter what type of hardware you buy. In the case of the MOTU devices you can turn down the buffer setting and eliminate the delay, but you're better off adjusting in software. All the multitrack recording software I've used so far allows you to put in a "latency" or delay setting that compensates for the delay caused by the hardware buffer. The hardware buffer is there for a good reason, so it's best (I think) to take care of it that way. It may be a pain in the ass to set up the first time (figuring out the amount of delay and adjusting to eliminate it), but I think the one-time inconvenience is worth it.
Hope that helps.